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TARGETING ADENOVIRUS WITH USE OF 
CONSTRAINED PEPTIDE MOTIFS 

TECHNICAL FIELD OF THE INVENTION 
5 The present invention pertains to a chimeric 

adenovirus fiber protein comprising a constrained 
nonnative amino acid sequence. The nonnative amino acid 
sequence .encodes a peptide motif that comprises an 
epitope for an antibody, or a ligand for a cell surface 
10 receptor, that can be employed in cell targeting. The 
present invention also pertains to vectors comprising 
such a chimeric adenovirus fiber protein, and to methods 
of using such vectors. - 

15 BACKGROUND OF THE INVENTION 

Despite their prior poor reputation as major 
pathogenic agents that lead to numerous infectious 
diseases, adenoviruses (and particularly, replication- 
deficient adenoviruses) have more recently attracted 

20 considerable recognition as highly effective viral 
vectors for gene therapy. Adenoviral vectors offer 
exciting possibilities in this new realm of therapeutics 
based on their high efficiency of gene transfer, 
substantial carrying capacity, and ability to infect a 

25 wide range of cell types (Crystal, Science, 270, 404-410 
(1995); Curiel et al . , Human Gene Therapy , 3, 147-154 
(1992); International Patent Application WO 95/21259). 

Due to these desirable properties of adenoviruses, 
recombinant adenoviral vectors have been used for the 

30 cell-targeted transfer of one or more recombinant genes 
to diseased cells or tissue in need of treatment. In 
terms of the general structure of an adenovirus, under 
the electron microscope, an adenovirus particle resembles 
a space capsule having protruding antennae (Xia et al., 

35 Structure , 2, 1259-1270 (1994)),. The viral capsid 

comprises at least six different polypeptides, including 
240 copies of the trimeric hexon (i.e., polypeptide II) 
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and 12 copies each of the pentameric penton (polypeptide 
III) base and trimeric fiber (Xia et ai . , supra ) . 

An adenovirus uses two separate cellular receptors, 
both of which must be present, to attach to and infect a 
5 cell (Wickham et al.. Cell , 73, 309-319 (1993)). First, 
the adenovirus fiber protein attaches the virus to a cell 
by binding to an as yet unidentified receptor. Then, the 
penton base binds to integrins, which are a family of 
heterodimeric cell-surface receptors that mediate 

10 cellular adhesion to the extracellular matrix molecules, 
as well as other molecules {Hynes, Cell , 69 , 11-25 
(1992)). Once an adenovirus is attached to a cell, it 
undergoes receptor-mediated internalization into 
clathrin-coated endocytic vesicles and is stepwise 

15 stripped down to the viral double-stranded genome, and 
then the genome (and some accompanying viral components) 
subsequently is transported to the cell nucleus, thus 
initiating infection (Svennson et al., J. Virol. , 51, 
687-694 (1984); Chardonnet et al.. Virology , 40, 462-477 

20 (1970); Greber et al.. Cell , 75, 477-486 (1993); 
Fitzgerald et al.. Cell , 32, 607-617 (1983)). 

The fiber monomer consists of an amino terminal tail 
(which attaches noncovalently to the penton base), a 
shaft (whose length varies among different virus 

25 serotypes), and a carboxy terminal globular knob domain 
(which is necessary and sufficient for host cell binding) 
(Devaux et al., J. Molec. Biol. , 215 , 567-588 (1990); Xia 
et al., supra ; Green et al , , EMBO J . , 2, 1357-1365 
(1983); Henry et al . , J. Virology , 68 (8) , 5239-524 6 

30 (1994)). The regions necessary for trimerization of 
fiber (which is required for penton base binding) also 
are located in the knob region of the protein (Henry et 
al. (1994), supra ; Novelli et al.. Virology , 185 , 365-376 
(1991)). The fiber, together with the hexon, determine 

35 the serotype specificity of the virus, and also comprise 
the main antigenic determinants of the virus (Watson et 
al., J. Gen. Virol. , 69, 525-535 (1988)). 
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This ability of adenoviral fiber and hexon protein 
to act as targets for a host immune response initially 
hampered attempts at adenoviral-mediated gene therapy. 
Namely, alterations in gene expression mediated by 
5 adenovirus are not permanent since the vector is not 

stably maintained. However, following adenoviral vector 
re-administration to prolong the therapeutic response, 
neutralizing antibodies can be raised against the 
adenoviral fiber and/or hexon proteins, thus 

10 circumventing protein production (Wohlfart, J. Virology , 
62, 2321-2328 (1988); Wohlfart et al . , J. Virology , 56, 
896-903 (1985)). Fortunately, such an immune response 
will not be generated with all uses of adenoviral 
vectors. Similarly, it is now known that if the presence 

15 of such neutralizing antibodies impedes adenoviral- 
mediated intracellular delivery, another adenoviral 
vector, e.g., another serotype adenoviral vector, or 
another adenovirus vector lacking the epitope against 
which the antibody is directed, can be employed instead 

20 (Crompton et al., J. Gen. Virol. , 75, 133-139 (1994)). 
Moreover, newer and effective techniques are constantly 
emerging to prevent an antibody response against the 
virus from precluding effective re-administration of an 
adenoviral vector (see, e.g.. International Patent 

25 Application WO 96/12406;. Mastrangeli et al.. Human Gene 
Therapy , 7' 79-87 (1996)). 

Thus, adenoviral-mediated gene therapy continues to 
hold great promise, in particular, with respect to 
redirecting adenovirus tropism. Namely, even though 

30 adenovirus can enter an impressive variety of cell types 
(see, e.g., Rosenfeld et al.. Cell , 68, 143-155 (1992); 
Quantin et al., Proc. Natl. Acad. Sci . , 89 , 2581-2584 
(1992)); Lemarchand et al, Proc. Natl. Acad. Sci. , 89, 
6482-6486 (1992); Anton et al., J. Virol. , 69, 4600-4606 

35 (1995); LaSalle et al.. Science , 259 , 988-990 (1993)), 
there still appear to be cells (e.g., lymphocytes) which 
are not readily amenable to adenovirus-raediated gene 
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delivery (see, e.g., Grubb et al.. Nature , 371 , 802-806 
{1994); Dupuit et al.. Human Gene Therapy , 6, 1185-1193 
(1995); Silver et al.. Virology 165. 377-387 (1988); 
Horvath et al., J. Virol. , 62 (1) , 341-345 (1988)). 
Similarly, even when targeting to cells that readily are 
infected by adenovirus, in many cases, very high levels 
of adenovirus particles have been used to achieve 
transduction. This is disadvantageous inasmuch as any 
immune response associated with adenoviral infection 
necessarily would be exacerbated with such high levels. 

Accordingly, researchers are seeking new ways to 
selectively introduce adenoviruses into cells that cannot 
be infected by adenoviruses, and to increase the 
effectiveness of adenoviral delivery into cells that are 
infected by adenoviruses. The general principle of 
redirecting adenovirus tropism is straightforward. In 
one common approach, by incorporating peptide binding 
motifs into an adenovirus coat protein such as fiber 
protein, the virus can be redirected to bind a cell 
surface binding site that it normally does not bind (see, 
e.g., Michael et al . , Gene Therapv . 2, 660-668 (1995); 
International Patent Application WO 95/26412; 
International Patent Application WO 94/10323; 
International Patent Application WO 95/05201) . A peptide 
binding motif is a short sequence of amino acids such as 
an epitope for an antibody (e.g., a bispecific antibody), 
or a ligand for a ceil surface binding site (e.g., a 
receptor), that can be employed in cell targeting. When 
the peptide motif binds, for instance to its 
corresponding cell surface binding site to which 
adenovirus normally does not bind, or binds with only low 
affinity, the adenovirus carrying the peptide motif then 
can selectively deliver genes to the cell comprising this 
binding site in a specific and/or more efficient manner. 

However, simply incorporating a known peptide motif 
into the fiber protein of an adenovirus may not be enough 
to allow the virus to bind and effectively transduce a 


wo 98/07865 PCT/US97/14719 

5 

target cell. The effectiveness of the peptide motif in 
redirecting virus binding to a new cell surface binding 
site depends on multiple factors, including the 
availability of the peptide motif to bind to the cell 
5 surface receptor, the affinity of the peptide motif for 
the cell surface binding site, and the number of target 
binding sites (e.g., receptors) present on the cell 
targeted for gene delivery. While the lattermost factor 
currently cannot be manipulated, in in vivo applications, 

10 the former two would appear to present areas for 
improvement of prevailing adenoviral-mediated gene 
therapy. For instance, earlier researchers have not 
considered that if the peptide motif is buried within the 
structure of the fiber protein,' and/or masked by the 

15 surrounding structure of the protein, the peptide motif 
will not be able to interact with and bind its target. 
Similarly, previous researchers have not addressed that 
it is the affinity of the peptide motif for the cell 
surface binding site (e.g., receptor) which determines 

20 how efficiently the virus can initiate and maintain a 
binding contact with the target receptor, resulting in 
cell infection/transduction. 

Thus, there remains a need for improved methods of 
cell targeting, and adenoviral vectors by which this can 

25 be accomplished. The present invention seeks to overcome 
at least some of the aforesaid problems of recombinant 
adenoviral gene therapy. In particular, it is an object 
of the present invention to provide improved vectors and 
methods for cell targeting through provision of a 

30 chimeric adenovirus fiber protein comprising a 

constrained peptide motif. These and other objects and 
advantages of the present invention, as well as 
additional inventive features, will be apparent from the 
following detailed description. 

35 


. BRIEF SUMMARY OF THE INVENTION 
The present invention provides a chimeric adenoviral 
fiber protein which differs from the wild-type (i.e., 
native) fiber protein by the introduction of a nonnative 
5 amino acid sequence in a conformationally- restrained 

(i.e., constrained) manner. The introduction results in 
the insertion of, or creation of, a constrained peptide 
motif that confers upon the resultant chimeric adenovirus 
fiber protein an ability to direct entry into cells of a 
10 vector comprising the chimeric fiber protein that is more 
efficient than entry into cells of a vector that is 
identical except for comprising a wild-type adenovirus 
fiber protein, and/or an ability to direct entry into 
cells that adenovirus comprising the wild-type fiber 
15 protein typically does not infect/transduce. The present 
invention also provides vectors that comprise the 
chimeric adenovirus fiber protein, and methods of 
constructing and using such vectors. 

2 0 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a diagram that illustrates the method of 
the invention of targeting adenovirus by conformationally 
restraining a nonnative amino acid sequence in an exposed 
loop of the fiber knob. to comprise a peptide binding 

25 motif. 

Figure 2 is a diagram that illustrates the method of 
the invention of targeting adenovirus by incorporating a 
conformationally restrained nonnative amino acid sequence 
(i.e., a sequence comprising a nonpreexisting loop) into 
30 the C-terminus of the fiber - protein to comprise a peptide 
binding motif. 

Figure 3 is a diagram that depicts the plasmid 
pl93(F5*) used to construct adenovirus fiber chimeras. 

Figure 4 is a diagram that depicts the plasmid pl93 
35 F5F2K, which encodes a chimeric fiber protein. 

Figure 5 is a diagram that depicts the plasmid pl93 
F5F2K (RKKK2) , which encodes a chimeric adenovirus fiber 


W0 9«W»7M5 PCT/US97/147t9 

7 

protein comprising the heparin binding domain (i.e., 
RKKKRKKK, or Arg Lys Lys Lys Arg Lys Lys Lys (SEQ ID 
N0:1]) in the exposed HI loop of the Ad2 fiber knob. 

Figure 6 is a diagram that depicts the plasmid pl93 
5 F5F2K{FLAG}, which encodes a chimeric adenovirus fiber 
protein comprising the FLAG epitope (i.e., DYKDDDDK or 
Asp Tyr Lys Asp Asp Asp Asp Lys [SEQ ID N0:2]) in the 
exposed HI loop of the Ad2 fiber knob. 

Figure 7 is a bar graph depicting p-galactosidase 
10 expression (% of control) in 293 cells transduced with 
either AdZ . F5F2K {RKKK2 ) (closed bars) or AdZ (open bars) 
in rhe absence (control) or presence (fiber) of soluble 
fiber protein. 

Figure 8 depicts the transfer plasmid pl93(F5)RGD, 
15 which was used to create the adenovirus vector AdZ.RGD. 

Figure 9 depicts the- transfer plasmid pl93 { F5 ) pLDV, 
which was used to create the adenovirus vector AdZ.pLDV. 

Figure 10 depicts the transfer plasmid pl93(F5) 
pYIGSR, which was used to create the adenovirus vector 
20 AdZ.-pYIGSR. 

Figure 11 is a graph of days post-infection versus 
FFU/cell for 293 cells infected with AdZ (open circles) 
or AdZ.RGD (closed squares). 

Figure 12 is a graph of virus particles added (per 6 
25 cm plate) versus p-galactosidase expression (RLU/0.3 ^1/7 
minutes) for A549 cells infected with AdZ (closed 
circles) or AdZ.RGD (closed triangles) . 

Figure 13 is a graph of virus particles added (per 6, 
cm plate) versus p-galactosidase expression (RLU/0.3 nI/7 
30 minutes) for CPAE cells infected with AdZ (closed 
circles) or AdZ.RGD (closed triangles) . 

Figure 14 is a graph of virus particles added (per 6 
cm plate) versus p-galactosidase expression (RLU/0.3 nl/7 
minutes) for HISM cells infected with AdZ (closed 
35 circles) or AdZ.RGD (closed triangles) . 

Figure 15 is a bar graph depicting the binding of 
AdZ.RGD (closed bars) and AdZ (open bars) expressed as % 
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of input of cell-bound vector, in 835 kidney cells in 
either the absence (control) or presence of competing 
fiber proteiin (F5), penton base protein (PB), or both 
fiber and penton base protein (F5/PB) . 
5 Figure 16 is a bar graph depicting the binding of 

AdZ.RGD (closed bars) and AdZ (open bars) expressed as % 
of input of cell-bound vector in AlO smooth muscle cells 
in either the absence (control) or presence of competing 
fiber protein (F5), penton base protein (PB) , or both 

10 fiber and penton base protein (F5/PB) . 

Figure 17 is a bar graph depicting the binding of 
AdZ.RGD (closed bars) and AdZ (open bars) expressed as % 
of input of cell-bound vector in CPAE endothelial cells 
in either the absence (control) or presence of competing 

15 fiber protein (FS), penton base protein (PB) , or both 
fiber and penton base protein (F5/PB) . 

Figure 18 is a bar graph depicting p-galactosidase 
expression (% of control) in A549 cells transduced with 
either AdZ.pYIGSR (closed bars) or AdZ (open bars) in the 

20 absence (control) or presence (fiber) of soluble fiber 
protein. 

Figure 19 is a bar graph depicting p-galactosidase 
expression (% of control) in Ramos cells transduced with 
either AdZ. pLDV (closed bars) or AdZ (open bars) in the 

25 absence (control) or presence (fiber) of soluble fiber 
protein, or fiber protein and EDTA (fiber + EDTA) . 

Figure 20 is a bar graph depicting p-galactosidase 
expression (% of control) in 293 cells transduced with 
either AdZ.RGD (closed bars), AdZ.pRGD (stippled bars), 

30 or AdZ (open bars) in the absence (control) or presence 
(fiber) of soluble fiber protein. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention provides, among other things, 
35 a recombinant adenovirus comprising a chimeric fiber 
protein. The chimeric fiber protein comprises a 
constrained nonnative amino acid sequence, in addition 
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to, or in place. of, a native amino acid sequence. This 
nonnative amino acid sequence allows the chimeric fiber 
(or a vector comprising the chimeric fiber) to more 
efficiently bind to and enter cells. 

5 

Chimeric Adenovirus Fiber Protein 

A "fiber protein" according to the invention 
preferably comprises an adenoviral fiber protein. Any 
one of the serotypes of human or nonhuman adenovirus (as 

10 described later in the context of the vector comprising a 
chimeric fiber protein) can be used 'as the source of the 
fiber protein or fiber gene. Optimally, however, the 
adenovirus is an Ad2 or Ad5 adenovirus. 

The fiber protein is "chimeric" in that it comprises 

15 amino acid residues that are not typically found in the 
protein as isolated from wild-type adenovirus (i.e., 
comprising the native protein, or wild-type protein) . 
The fiber protein thus comprises a "nonnative amino acid 
sequence". By "nonnative amino acid sequence" is meant a 

20 sequence of any suitable length, preferably from about 3 
to about 200 amino acids, optimally from about 3 to about 
30 amino acids. Desirably, the nonnative amino acid 
sequence is introduced into the fiber protein at the 
level of gene expression (i.e., by introduction of a 

25 "nucleic acid sequence that encodes a nonnative amino 
acid sequence") . Such a nonnative amino acid sequence 
either is introduced in place of adenoviral sequences, or 
in addition to adenoviral sequences. Regardless of the 
nature of the introduction, its integration into an 

30 adenoviral fiber protein' at the level of either DMA or 
protein, results in the generation of a peptide motif 
(i.e., a peptide binding motif) in the resultant chimeric 
fiber protein. 

The peptide motif allows for cell targeting, for 

35 instance, by comprising an epitope for an antibody, or a 
ligand for a cell surface binding site. The peptide 
motif optionally can comprise other elements of use in 
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cell targeting (e.g., a single-chain antibody sequence). 
The peptide binding motif may be generated by the 
insertion, and may comprise, for instance, native and 
nonnative sequences, or may be entirely made up of 
5 nonnative sequences. The peptide motif that results from 
the insertion of the nonnative amino acid sequence into 
the chimeric fiber protein can be either a high affinity 
peptide (i.e., one that binds its cognate binding site 
when provided at a relatively low concentration) or a low 
10 affinity peptide (i.e., one that binds its cognate 
binding site when provided at a relatively high 
concentration). Preferably, however, the resultant 
peptide motif is a high affinity motif, particularly one 
that has become of high affinity for its cognate binding 
L5 site due to its constraint within the adenovirus fiber 
protein . 

An "antibody" includes, but is not limited to, 
immunoglobulin molecules and immunologically active 
portions of immunoglobulin molecules such as portions 

10 containing a paratope (i.e., an antigen binding site). 

In particular, an antibody preferably can be a bispecific 
antibody, i.e., having one paratope directed to an 
epitope of the chimeric fiber protein, and another 
paratope directed to an epitope of a cell surface binding 

5 site. 

A "cell surface binding site" encompasses a receptor 
(which preferably is a protein, carbohydrate, 
glycoprotein, or proteoglycan) as well as any oppositely 
charged molecule (i.e., oppositely charged with respect 

3 to the chimeric coat protein) or other type of molecule 
with which the chimeric coat protein can interact to bind 
the cell, and thereby promote cell entry. Examples of 
potential cell surface binding sites include, but are not 
limited to: heparin and chondroitin sulfate moieties 

> found on glycosaminoglycans; sialic acid moieties found 
on mucins, glycoproteins, and gangliosides; major 
histocompatibility complex I (MHC I) glycoproteins; 
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coiranon carbohydrate components found in membrane 
glycoproteins, including mannose, N-acetyl-galactosamine, 
N-acetyl-glucosamine, fucose, galactose, and the like. 
However, a. chimeric fiber protein according to the 
5 invention, and methods of use thereof, is not limited to 
any particular mechanism of cellular interaction (i.e., 
interaction with a particular cell surface binding site) 
and is not to be so construed, 

A cell surface binding site according to the 

10 invention preferably is one that previously was 

inaccessible to interaction with a wild-type adenoviral 
fiber protein, or was accessible only at a very low 
level, as reflected by the reduced efficiency of entry of 
a wild-type adenoviral fiber protein-containing vector as 

15 compared with a vector comprising a chimeric adenovirus 
fiber protein according to, the invention. The insertion 
of the nonnative amino acid sequence in the chimeric 
fiber protein thus desirably imparts upon the chimeric 
fiber protein an- ability to bind to a binding site 

20 present on a cell surface which wild-type fiber protein 
does not bind, or binds with very low affinity. This 
preferably results in a situation wherein the chimeric 
adenovirus fiber protein is able to direct entry into 
cells of a vector via the interaction of the nonnative 

25 amino acid sequence, either directly or indirectly, with 
a cellular receptor other than the fiber receptor. 

This also preferably results in a situation wherein 
the chimeric adenovirus fiber protein is able to direct 
entry into cells of a vector comprising the chimeric 

30 adenovirus fiber that is more efficient than entry into 
cells of a vector that is identical except for comprising 
a wild-type adenovirus fiber protein rather than the - 
chimeric adenovirus protein. Also preferably, the 
chimeric adenovirus fiber protein may act to increase the 

35 specificity of targeting, e.g., by changing the 
specificity of the fiber protein. 
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"Efficiency of entry" can be quantitated by several 
means. In particular, efficiency of entry can be 
quantitated by introducing a chimeric fiber protein into 
a vector, preferably a viral vector, and monitoring cell 
5 entry (e.g., by vector-mediated delivery to a cell of a 
gene such as a reporter gene) as a function of 
multiplicity of infection (MOI). m this case, a reduced 
MOI required for cell entry of a vector comprising a 
chimeric adenoviral fiber protein as compared with a 
10 vector that is identical, except for comprising a wild- 
type adenoviral fiber protein rather than said chimeric 
adenovirus fiber protein, indicates "more efficient- 
entry. 

Similarly, efficiency of entry can be quantitated in 
15 terms of the ability of vectors containing chimeric or. 
wild-type fiber proteins, or the soluble chimeric or 
wild-type fiber proteins themselves, to bind to cells. 
In this case, increased binding exhibited for the vector 
containing a chimeric adenoviral fiber protein, or the 
!0 chimeric fiber protein itself, as compared with the 
identical vector containing a wild-type fiber protein 
instead, or the wild-type fiber protein itself, is 
indicative of an increased efficiency of entry, or "more 
efficient" entry. 
5 According to this invention, a nonnative amino acid 

sequence is. conformationally-restrained, or 
"constrained". A nonnative amino acid sequence is 
constrained when it is present in a chimeric fiber 
protein and is presented to a cell in such a fashion that 
0 the ability of the chimeric fiber protein to bind to the 
cell and/or mediate cell entry is increased, e.g., 
relative to the wild-type protein. Such constraint 
according to the present invention can be achieved by the 
placement of a nonnative amino acid sequence in an 
5 exposed loop of the chimeric fiber protein, or, through 
the placement of the sequence in another location and 
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creation of a loop-like structure comprising the 
nonnative amino acid sequence at that site. 

Adenoviral-mediated gene delivery to specific 
tissues (i.e., ceil targeting) has been impeded by the 
5 fact that, generally, lower affinity, unconstrained 
peptides often are not as effective in mediating 
adenovirus binding to target receptors as are constrained 
peptides. For instance, peptide motifs identified by 
phage display or identified in generally are presented in 

10 a constrained environment. Accordingly, the present 
application provides a means of targeting adenovirus 
wherein, in one embodiment, the peptide motifs are 
presented in the constrained environment of the loop 
domains of the knob of the adenovirus fiber protein. 

15. This method is advantageous since not all the 

residues of the exposed fiber knob loops are critical for 
the assembly or functioning of the fiber protein, and 
thus provide convenient sites at which the peptide motifs 
can be inserted. This method further is advantageous in 

20 that additions within a loop of a protein structure will 
be more resistant to proteolytic degradation than will 
additions in the end of a protein. Moreover, for low 
affinity peptide motifs in particular, this method is 
more efficient than the method wherein the peptide motifs 

25 are presented as unconistrained linear structures at the 
C-terminus of the knob of the fiber. Conceivably, 
"constraint", according to the invention, increases 
affinity since it puts the molecule in a topological 
conformation in which it is in sync with its receptor, 

30 and, in this fashion, facilitates binding. However, the 
specification is not limited to any particular mechanism 
of action and is not to be so construed. 

In terms of the loop domains of the fiber knob which 
can be employed in the context of the invention, the 

35 crystal structure of the fiber knob has been described 
(see, e.g., Xia et al., supra , particularly Figure 4). 
The knob monomer comprises an eight-stranded antiparallel 
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P-sandwich fold. The overall structure of the fiber knob 
trimer resembles a three-bladed propeller with certain p- 
strands of each of the three monomers comprising the 
faces of the blades. In particular, the following 
5 residues of the Ad5 fiber knob appear important in 

hydrogen bonding in the p-sandwich motif: 400-402, 419- 
428, 431-440, 454-461, 479-482, 485-486, 516-521, 529- 
536, 550-557, and 573-578. The remaining residues of the 
protein (which do not appear to be critical in forming 
10 the fiber protein secondary structure) define the exposed 
loops of the protein knob domain. In particular, 
residues inclusive of 403-418 comprise the AB loop, 
residues inclusive of 441-453 comprise the CD loop, 
residues inclusive of 487-514 comprise the DG loop, 
15 residues inclusive of 522-528 comprise the GH loop, 

residues inclusive of 537-549 comprise the HI loop and 
residues inclusive of 558-572 comprise the IJ loop. 

According to this invention, "loop" is meant in the 
generic sense of defining a span of amino acid residues 
20 (i.e., more than one, preferably less than two hundred, 
and even more preferably, less than thirty) that can be 
substituted by the nonnative amino acid sequence to 
comprise a peptide motif that allows for cell targeting. 
While such loops are defined herein with respect to the 
25 Ad5 sequence, the sequence alignment of other fiber 
species have been described (see, e.g., Xia et al . , 
supra ) . For these other species (particularly Ad2, Ad3, 
Ad7, Ad40 and Ad41 described in Xia et ai., supra ) , the 
corresponding loop regions of the knob domains appear to 
30 be comparable. 

Furthermore, the corresponding residues important in 
the fiber knob for protein binding/folding appear to be 
conserved between fiber proteins of different adenoviral 
serotypes (Xia et al., supra ) . This suggests that even 
35 for those adenoviral species in which the crystal 

structure of the fiber protein is not known, outside of 
these conserved residues will lie nonconserved regions. 
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or regions that do not exhibit the. high level of 
conservation observed for the residues critical to 
protein functionality. Likely the sequence of the fiber 
knob protein in these nonconserved regions will be 
5 present as a loop due to the absence of important 
intramolecular interactions in this region of the 
protein. The loop sequences comprising these 
nonconserved regions similarly can be mutated as 
described herein by incorporation of peptide motifs 

10 allowing cell targeting. These so-called non-conserved 
sequences likely include any amino acids that occur 
outside of the conserved regions (i.e., residues 
noninclusive of those corresponding to Ad5 residues 400- 
402, 419-428, 431-440, 454-461, 479-482, 485-486, 516- 

15 521, 529-536, 550-557, and 573-578). 

More generally, the nonconserved regions will 
comprise hydrophobic residues that typically are found on 
the interior of a protein. Such hydrophobic residues 
include, but are not limited to. He, Val, Leu, Trp, Cys, 

20 and Phe . In contrast, the conserved regions generally 
will comprise hydrophilic residues such as charged 
residues (e.g., Arg, Lys, Glu, Asp, and the like} or 
polar residues or residues comprising a hydroxyl group 
(e.g., Thr, Ser, Asn, Gin, etc.). This means that a 

25 rough approximation of the exposed and buried amino acids 
of the fiber protein can be derived based on its 
hydrophobicity/hydrophilicity plot. 

Thus, the present invention preferably provides a 
chimeric adenovirus fiber protein comprising a 

30 constrained nonnative amino acid sequence. Preferably, 
the nonnative amino acid sequence is constrained by its 
presence in a loop of the knob of the chimeric fiber 
protein. In particular, desirably the nonnative amino 
acid sequence is inserted into or in place of a protein 

35 sequence in a loop of the knob of the chimeric adenoviral 
fiber protein. Optionally, the fiber protein loop is 
selected from the group consisting of the,,AB, CD, DC, GH, 


and IJ loops, and desirably is the HI loop. Also, 
preferably, the loop comprises amino acid residues in the 
fiber knob other than Ad5 residues 400-402, 419-428, 431- 
440, 454-461, 479-482, 485-486, 516-521, 529-536, 550- 
5 557, and 573-578. Desirably, the loop comprises amino 
acid residues selected from the group consisting of 
residues 403-418, 441-453, 487-514, 522-528, 537-549, and 
558-572. 

In particular, preferably the nonnative amino acid 

10 sequence present in the loop comprises a sequence 

selected from the group consisting of SEQ ID N0:1, SEQ ID 
N0:2, SEQ ID N0:3, SEQ ID NO : 4 , SEQ ID N0:5, SEQ ID 
N0:17, SEQ ID N0:19, SEQ ID NO:23, SEQ ID N0:31, SEQ ID 
NO: 35, SEQ ID NO: 39, SEQ ID NO: 43, SEQ ID NO: 4 9, SEQ ID 

15 NO:53, SEQ ID NO:56, SEQ ID NO:59, and SEQ ID NO:63, SEQ 
ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:79, and 
wherein the sequence may be deleted at either the C- or 
N-terminus by 1, 2, or 3 residues. The nonnative amino 
acid sequence also desirably can comprise conservative 

20 amino acid substitutions of these sequences, as further 
described herein. Optionally, these sequences can be 
present in the chimeric protein as depicted, for 
instance, in Figtire 4, Figure 5, Figure 6, Figure 8, 
Figure 9, and Figure 10. 

25 The invention also provides a means of targeting 

adenovirus wherein the peptide motifs are presented in a 
constrained environment at the C-terminus of the fiber 
protein in the region of the fiber knob. This method 
entails the generation of loops (i.e., "nonpreexisting 

30 loops") by bonding between cysteine residues or through 
use of other sequences capable of forming loops (e.g., a 
P-sheet), thereby creating a loop-like secondary 
structure in the domain of the protein in which the 
peptide motif is inserted. Generally, according to the 

35 invention, the nonnative amino acid sequence being added 
itself will form a loop-like structure (e.g., through 
disulfide bonding between cysteine residues occurring in 
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vivo) . However, it also is possible that the loop may 
form due to bonding, e.g., between a cysteine residue 
present in the nonnative amino acid sequence, and one in 
the wild-type fiber protein. In this sense, the looping 
of the sequence is not inherent, but is potential. 

In particular, a chimeric adenovirus fiber protein 
according to the invention comprises a nonnative amino 
acid sequence that is constrained, preferably by its 
possession of an RGD peptide (or other similar peptide 
such as LDV, as described herein) and one or more 
cysteine pairs. According to this invention, a "pair" 
comprises two cysteines separated by at least one 
intervening amino acid. Desirably, when the sequence 
comprises only a single pair, the cysteines are separated 
by the RGD sequence (or other similar sequence that can 
be employed to effect cell targeting, and preferably, is 
less than 30 amino acids) such that the nonpreexisting 
loop can.be created, i.e., through disulfide bonding. 
Preferably, the cysteine residues in this case are 
separated by less than 30 amino acids, for instance, a 
mixture of glycine and serine residues as in [SEQ ID 
NO: 72] . Regardless of the nonnative amino acid sequence 
employed, it must comprise a loop-like secondary 
structure. 

In terms 'of this nonpreexisting loop, one potential 
peptide motif and variations thereof have been described 
herein. However, other RGD-containing cyclic peptides 
have been described in the literature and can be employed 
in the context of the invention as the nonnative amino 
acid sequence (see, e.g., Koivunen et al,, 
Bio/Technology , 13, 265-270 (1995)). In particular, 
another nonnative amino acid sequence according to the 
invention can comprise the sequence CDCRGDCFC (i.e., Cys 
Asp Cys Arg Gly Asp Cys Phe Cys [SEQ ID N0:3]) . The 
nonnative amino acid sequence, however, preferably 
comprises Cys Xaa Cys Arg Gly Asp Cys Xaa Cys (SEQ ID 
NO:4] (wherein "Xaa" is any nucleic acid) or Cys(Xaa)A Cys 
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Arg Gly Asp Cys(Xaa)B Cys (SEQ ID NO: 5], wherein "A" and 
"B" can vary independently and can be any number from 0 
to 8, so long as either A or B is 1. In particular, the 
nonnative amino acid sequence preferably comprises the 
5 sequence Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Arg Gly 
Asp Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys [SEQ ID 
NO: 5], wherein deletions can be made of amino acid 
residues other than cysteine on either one or both 
side(s) of the RGD (i.e., Arg Gly Asp) sequence of 1, 2, 
3 3, 4, 5, 6, 7, or 8 residues. 

Thus, desirably the nonnative amino acid sequence 
comprising the nonpreexisting loop is inserted into or in 
place of a protein sequence at the C-terminus of the 
chimeric adenovirus fiber protein. Preferably the 
' nonnative amino acid sequence comprising the 

nonpreexisting loop is inserted into a loop of the knob 
of the chimeric adenoviral fiber protein. Optimally the 
nonnative amino acid sequence comprises a sequence 
selected from the group consisting of SEQ ID NO: 3, SEQ ID 
NO: 4 and SEQ ID NO: 5, wherein the sequence may be deleted 
at either the C- or N- terminus by 1, 2, or 3 residues. 
The amino acid sequence also desirably can comprise 
conservative amino acid substitutes of these sequences, 
as further described herein. 

The non-preexisting loop optionally is attached to 
the C-terminus of the fiber protein or in a fiber knob 
loop by means of a so-called "spacer" sequence. The 
spacer sequence may comprise part of the nonnative amino 
acid sequence proper, or it may be an entirely separate 
sequence. In particular, a spacer sequence is a sequence 
that preferably intervenes between the native protein 
sequence and the nonnative sequence, between a nonnative 
sequence and another nonnative sequence, or between a 
native sequence and another native sequence. Such a 
sequence desirably is incorporated into the protein to 
ensure that the nonnative sequence comprising the epitope 
for an antibody or cell surface binding site projects 
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from the three dimensional structure of the chimeric 
fiber in such a fashion so as to be able to interact with 
and bind to cells. A spacer sequence can be of any 
suitable length, preferably from about 3 to about 30 
amino acids, and comprises any amino acids, for instance, 
a mixture of glycine and serine residues as in [SEQ ID 
NO:72) . Optimally, the spacer sequence does not 
interfere with the functioning of the fiber protein. 


10 Nucleic Acid Encoding a Chimeric Aden ovirus Fiber Protein 
As indicated previously, preferably the nonnative 
amino acid sequence is introduced at the level of DNA. 
Accordingly, the invention also provides an isolated and 
purified nucleic acid encoding a. chimeric adenovirus 

15 fiber protein comprising a constrained nonnative amino 

acid sequence according to the invention. Desirably, the 
nucleic acid sequence that encodes the nonnative amino 
acid sequence comprises a sequence selected from the 
group consisting of SEQ ID NO: 16, SEQ ID N0:18, SEQ ID 

20 NO:22, SEQ ID NO:26, SEQ ID NO:28, SEQ ID N0:30, SEQ ID 
NO:34, SEQ ID NO:38, SEQ ID NO:42, SEQ ID NO:48, SEQ ID 
NO:52, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, and SEQ 
ID NO: 62, as well as conservatively modified variants of 
these nucleic acid sequences.. 

25 A "conservatively modified variant" is a variation 

on the nucleic acid sequence that results in a 
conservative amino acid substitution. A "conservative 
amino acid substitution" is an amino acid substituted by 
an alternative amino acid of similar charge density, 

30 hydrophilicity/hydrophobicity, size, and/or configuration 
(e.g., Val for lie). In comparison, a "nonconservatively 
modifieid variant" is a variation on the nucleic acid 
sequence that results in a nonconservative amino acid 
substitution. A "nonconservative amino acid 

35 substitution" is an amino acid substituted by an 

alternative amino acid of differing charge density, 
hydrophilicity/hydrophobicity, size, and/or configuration 
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(e.g., Val for Phe) . The means of making such 
modifications are well known in the art, are described in 
the Examples which follow, and also can be accomplished 
by means of commercially available kits and. vectors 
) (e.g.. New England Biolabs, Inc., Beverly, MA; Clontech, 
Palo Alto, CA) . Moreover, the means of assessing such 
substitutions (e.g., in terms of effect on ability to 
bind and enter cells) are described in the Examples 
herein. Other approaches described in the art also are 
' available for identifying peptide sequences that can act 
as ligands for a cell surface receptor and, hence, are of 
use in the present invention (see, e.g., Russell, Nature 
Medicine . 2, 276-277 (1996)). 

The means of making such a chimeric fiber protein, 
particularly the means of introducing the sequence at the 
level of DNA, is well known in the art, and is described 
in the Examples that follow. Briefly, the method 
comprises introducing a sequence into the sequence 
encoding the fiber protein so as to insert a new peptide 
motif into or in place of a protein sequence at the C- 
terminus of the wild-type fiber protein, or in a loop of 
a knob of the wild-type fiber protein. Such introduction 
can result in the insertion of a new peptide binding 
motif, or creation of a peptide motif (e.g., wherein some 
of the sequence comprising the motif is already present 
in the native fiber protein) . The method also can be 
carried out to replace fiber sequences with a nonnative 
amino acid sequence according to the invention. 

Generally, this can be accomplished by cloning the 
nucleic acid sequence encoding the chimeric fiber protein 
into a plasraid or some other vector for ease of 
manipulation of the sequence. Then, a unique restriction 
site at which further sequences can be added into the 
fiber protein is identified or inserted into the fiber 
sequence. A double-stranded synthetic oligonucleotide 
generally is created from overlapping synthetic single- 
stranded sense and antisense oligonucleotides such that 
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the double-stranded oligonucleotide incorporates the 
restriction sites flanking the target sequence and, for 
instance, can be used to incorporate replacement DNA. 
The plasmid or other vector is cleaved with the 
5 restriction enzyme, and the oligonucleotide sequence 
having compatible cohesive ends is ligated into the 
plasmid or other vector to replace the wild-type DNA. 
Other means of in vitro site-directed mutagenesis such as 
are known to those skilled in the art, and can be 

10 accomplished {in. particular, using PGR) , for instance, by 
means of commercially available kits, can also be used to 
introduce the mutated sequence into the fiber protein 
coding sequence. 

Once the mutated sequence is introduced into the 

15 chimeric coat protein, the nucleic acid fragment encoding 
the sequence can be isolated, e.g., by PGR amplification 
using 5* and 3' primers, preferably ones that terminate 
in further unique restriction sites. Use of primers in 
this fashion results in an amplified chimeric fiber- 

20 containing fragment that is flanked by the unique 

restriction sites. The unique restriction sites can be 
used for further convenient subcloning of the fragment. 
Other means of generating a chimeric fiber protein also 
can be employed. These methods are highly familiar to 

25 those skilled in the art. 

Vector Comprising a Chimeric Adenovirus Fiber Protein 

A "vector" according to the invention is a vehicle 

for gene transfer as that term is understood by those 
30 skilled- in the art. Three types of vectors encompassed 

by the invention are: plasmids, phages, and viruses. 

Plasmids, phages, and viruses can be transferred to a 

cell in their nucleic acid form (e.g., via transf ection) . 

In comparison, phages and viruses also can be transferred 
35 with the nucleic acid in a "capsular" form. Hence, the 

vectors (e.g., capsular form) that can be employed for 

gene transfer are referred to herein generally as 


^^^^^ PCT/US97/,4719 

22 

"vectors", with nucleic acid forms being referred to more 
particularly as "transfer vectors". However, transfer 
vectors also are vectors within the context of the 
invention. 

3 Preferably, a vector according to the invention is a 

virus, especially a virus selected from the group 
consisting of nonenveloped viruses, i.e., nonenveloped 
RNA or DNA viruses. Also, a virus can be selected from 
the group consisting of enveloped viruses, i.e., 

I enveloped RNA or DNA viruses. Such viruses preferably 
comprise a fiber protein, or an analogous coat protein 
that is used for cell entry. Desirably,, the viral coat 
protein is one that projects outward from the capsid such 
that it is able to interact with cells. In the case of 
enveloped RNA or DNA viruses, preferably che coat protein 
is a lipid envelope glycoprotein (i.e., a so-called spike 
or peplomer) . 

In particular, preferably a vector is a nonenveloped 
virus {i.e., either a RNA or DNA virus) from the family 
Hepadnaviridae, Parvoviridae, Papovaviridae, 
Adenoviridae, or Picornaviridae . A preferred 
nonenveloped virus according to the invention is a virus 
of the family Hepadnaviridae, especially of the genus 
Hepad/3a virus. A virus of the family Parvoviridae 
desirably is of the genus Parvovirus (e.g., parvoviruses 
of mammals and birds) or Dependovirus (e.g., adeno- 
associated viruses (AAVs) ) . A virus of the family 
Papovaviridae preferably is of the subfamily 
Papillomavirinae (e.g., the papillomaviruses including, 
but not limited to, human papillomaviruses (HPV) 1-4 8) or 
the subfamily Polyomavirznae (e.g., the polyomaviruses 
including, but not limited to, JC, SV40 and BK virus) . A 
virus of the family Adenoviridae desirably is of the 
genus Mastadenovirus (e.g., mammalian adenoviruses) or 
Aviadenovirus (e.g., avian adenoviruses). A virus of the 
family Picornaviridae is preferably a hepatitis A virus 
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(HAV), hepatitis B virus (HBV) , or a non-A or non-B 
hepatitis virus. 

Similarly, a vector can be an enveloped virus from 
the family Herpesviridae or Retroviridae, or can be a 
5 Sindbis virus. A preferred enveloped virus according to 
the invention is a virus of the family Herpesviridae, 
especially of the subfamily or genus Alptiaherpesvirinae 
(e.g., herpes simplex-like viruses), Simplexvirus (e.g., 
herpes simplex-like viruses), Varicellavirus (e.g., 

10 varicella and pseudorabies-like viruses), 

Betaherpesvirinae (e.g., the cytomegaloviruses) , 
Cytomegalovirus (e.g., the human cytomegaloviruses), 
Gammaherpesvirinae (e.g., the lymphocyte-associated 
viruses), and Lymphocryptovirus (e.g., EB-like viruses). 

15 Another preferred enveloped virus is a RNA virus of 

the family i?etroviridae (i.e., a retrovirus), 
.particularly a virus of the genus, or subfamily 
Oncovirinae, Spumavirinae, Spumavirus, Lantivirinae, or 
Lentivirus. A RNA virus of the subfamily Oncovirinae is 

20 desirably a human T-lymphotropic virus type 1 or 2 (i.e., 
HTLV-1 or HTLV-2) or bovine leukemia virus (BLV) , an 
avian leukosis-sarcoma virus (e.g., Rous "sarcoma virus 
(RSV), avian myeloblastosis virus (AMV) , avian 
erythroblastosis virus (AEV) , Rous-associated virus 

25 (RAV)-l to 50, RAV-0) , a mammalian C-type virus (e.g., 
Moloney murine leukemia virus (MuLV) , Harvey murine 
sarcoma virus (HaMSV) , Abelson murine leukemia virus (A- 
MuLV) , AKR-MuLV, feline leukemia virus (FeLV) , simian 
sarcoma virus, reticuloendotheliosis virus (REV) , spleen 

30 necrosis virus (SNV) ) , a B-type virus (e.g., mouse 

mammary tumor virus (MMTV) ) , or a D-type virus (e.g., 
Mason-Pfizer monkey virus (MPMV) , "SAIDS" viruses). A 
RNA virus of the subfamily Lentivirus is desirably a 
human immunodeficiency virus type 1 or 2 (i.e., HIV-1 or 

35 HIV-2, wherein HIV-1 was formerly called lymphadenopathy 
associated virus 3 (HTLV-III) and acquired immune 
deficiency syndrome (AIDS ) -related virus (ARV) ) , or 
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another virus related to HIV-1 or HIV-2 that has been 
identified and associated with AIDS or AIDS-like disease 
The acronym "HIV" or terms "AIDS virus" or "human 
immunodeficiency virus" are used herein to refer to these 
5 HIV viruses, and HIV-related and -associated viruses, 
generically. Moreover, a RNA virus of the subfamily 
Lentivirus preferably is a Visna/maedi virus (e.g., such 
as infect sheep), a feline immunodeficiency virus (FIV) , 
bovine lentivirus, simian immunodeficiency virus (SIV) , 
10 an equine infectious anemia virus (EIAV), or a caprine' 
arthritis-encephalitis virus (CAEV) . 

An especially preferred vector according to the 
invention is an adenoviral vector (i.e., a viral vector 
of the family Adenoviridae, optimally of the genus 
.5 Mastadenovirus) . Desirably such a vector is an Ad2 or 
Ad5 vector, although other serotype adenoviral vectors 
can be employed. Adenoviral stocks that can be employed 
according to the invention include any of the adenovirus 
serotypes 1 through 47 currently available from American 
0 Type Culture Collection (ATCC, Rockville, MD) , or from 
any other serotype of adenovirus available from any other 
source. For instance, an adenovirus can be of subgroup A 
(e.g., serotypes 12, 18, 31), subgroups (e.g., serotypes 
3, 7, II, 14, 21, 34, 35), subgroup C (e.g., 

5 serotypes 1, 2, 5, 6), subgroup D (e.g., serotypes 8 9 
10, 13, 15, 17, 19, 20, 22-30, 32, 33, 36-39, 42-47), 
subgroup E (serotype 4), subgroup F (serotype 40, 4l!, or 
any other adenoviral serotype. 

The adenoviral vector employed for gene transfer can 
) be wild-type (i.e., replication competent) . Alternately, 
the adenoviral vector can comprise genetic material with' 
at least one modification therein, which can render the 
virus replication deficient. The modification to the 
adenoviral genome can include, but is not limited to, 
' addition of a DNA segment, rearrangement of a DNA 

segment, deletion of a DNA segment, replacement of a DNA 
segment, or introduction of a DNA lesion. A DNA segment 


wo 98/07865 PCT/US97/14719 

25 

can be as small as one nucleotide and as large as 36 
kilobase pairs (i.e., the approximate size of the 
adenoviral genome) or, alternately, can equal the maximum 
amount which can be packaged into an adenoviral virion 
5 (i.e., about 38 kb) . Preferred modifications to the 

adenoviral genome include modifications in the El, E2, E3 
and/or E4 region. An adenoviral vector also preferably 
can be a cointegrated, i.e., a ligation of adenoviral 
genomic sequences with other sequences, such as other 

10 virus, phage, or plasmid sequences. 

In terms of a viral vector (e.g., particularly a 
replication deficient adenoviral vector) , such a vector 
can comprise either complete capsids (i.e., including a 
viral genome such as an adenoviral genome) or empty 

15 capsids (i.e., in which a viral genome is lacking, or is 
degraded, e.g., by physical or chemical means). 
Preferably the viral vector comprises complete capsides. 
Along the same lines, since methods are available for 
transferring viruses, plasraids, and phages in the form of 

20 their nucleic acid sequences (i.e., RNA or DNA) , a vector 
(i.e., a transfer vector) similarly can comprise RNA or 
DNA, in the absence of any associated protein such as 
capsid protein, and in the absence of any envelope lipid. 
Thus, according to the invention whereas a vector 

25 "comprises" a chimeric adenoviral fiber protein, a 

transfer vector comprises a chimeric adenoviral fiber 
protein in the sense, that it "encodes" the chimeric 
adenoviral fiber protein. 

A vector according to the invention can comprise 

30 additional sequences and mutations, e.g., some within the 
fiber protein itself. For instance, a vector according 
to the invention further preferably comprises a nucleic 
acid comprising a passenger gene. 

A "nucleic acid" is a polynucleotide (DNA or RNA) . 

35 A "gene" is any nucleic acid sequence coding for a 

protein or a nascent RNA molecule. A "passenger gene" is 
any gene which is not typically present in and is 
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subcloned into a vector (e.g., a transfe_ ^, 

according to the present invention, and which upon 
introduction into a host cell is accompanied by a 
discernible change in the intracellular environment 
5 (e.g., by an increased level of deoxyribonucleic acid 

(DNA), ribonucleic acid (RNA), peptide or protein, or by 
an altered rate of production or degradation thereof) . A 
"gene product" is either an as yet untranslated RNA 
molecule transcribed from a given gene or coding sequence 

0 (e.g., mRNA or antisense RNA) or the polypeptide chain 
(i.e., protein or peptide) translated from the mRNA 
molecule transcribed from the given gene or coding 
sequence. Whereas a gene comprises coding sequences plus 
any non-coding sequences, a "coding sequence" does not 

5 include any non-coding (e.g., regulatory) DNA. A gene or 
coding sequence is "recombinant" if the sequence of bases 
along the molecule has been altered from the sequence in 
which the gene or coding sequence is typically found in 
nature, or if the sequence of bases is not typically 

1 found in nature. According to this invention, a gene or 
coding sequence can be wholly or partially synthetically 
made, can comprise genomic or complementary DNA (cDNA) 
sequences, and can be provided in the form of either DNA 
or RNA. 

Non-coding sequences or regulatory sequences include 
promoter sequences. A "promoter" is a DNA sequence that 
directs the binding of RNA polymerase and thereby 
promotes RNA synthesis. "Enhancers" are cis-acting 
elements of DNA that stimulate or inhibit transcription 
of adjacent genes. An enhancer that inhibits 
transcription is also termed a "silencer". Enhancers 
differ from DNA-binding sites for sequence-specific DNA 
binding proteins found only in the promoter (which are 
also termed "promoter elements") in that enhancers can 
function in either orientation, and over distances of up 
to several kilobase pairs, even from a position 
downstream of a transcribed region. According to the 
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invention, a coding sequence is "operably linked" to a 
promoter (e.g., when both the coding sequence and the 
promoter constitute a passenger gene) when the promoter 
is capable of directing transcription of that coding 
5 sequence. 

Accordi-ngly, a "passenger gene" can be any gene, and 
desirably is either a therapeutic gene or a reporter 
gene. Preferably a passenger gene is capable of being 
expressed in a cell in which the vector has been 

10 internalized. For instance, the passenger gene can 
comprise a reporter gene, or a nucleic acid sequence 
which encodes a protein that can in some fashion be 
detected in a cell. The passenger gene also can comprise 
a therapeutic gene, for instance, a therapeutic gene 

15 which exerts its effect at the level of RNA or protein. 
For instance, a protein encoded by a transferred 
therapeutic gene can be employed in- the treatment of an 
inherited disease, such as, e.g., the cystic fibrosis 
transmembrane conductance regulator cDNA for the 

20 treatment of cystic fibrosis. The protein encoded by the 
therapeutic gene may exert its therapeutic effect by 
resulting in cell killing. For instance, expression of 
the gene in itself may lead to cell killing, as with 
expression of the- diphtheria toxin A gene, or the 

25 expression of the gene may render cells selectively 

sensitive to the killing action of certain drugs, e.g., 
expression of the HSV thymidine kinase gene renders cells 
sensitive to antiviral compounds including acyclovir, 
gancyclovir and FIAU ( 1- (2-deoxy-2-f luoro-p-D- 

30 arabinofuranosil) -5-iodouracil) . 

Moreover, the therapeutic gene can exert its effect 
at the level of RNA, for instance, by encoding an 
antisense message or ribozyme, a protein which affects 
splicing or 3' processing (e.g., polyadenyiation) , or can 

35 encode a protein which acts by affecting the level of 
expression of another gene within the cell (i.e., where 
gene expression is broadly considered to include all 
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steps from initiation of cranscripcion through production 
of a processed protein) , perhaps, among other things, by 
mediating an altered rate of mRNA accumulation, an 
alteration of mRNA transport, and/or a change in post- 
5 transcriptional regulation. Accordingly, the use of the 
term "therapeutic gene" is intended to encompass these 
and any other embodiments of that which is more commonly 
referred to as gene therapy as known to those of skill in 
the art. Similarly, the recombinant adenovirus can be 
10 used for gene therapy or to study the effects of 

expression of the gene in a given cell or tissue in vitro 
or in vivo. 

The present invention accordingly provides a vector 
comprising a chimeric adenovirus fiber protein that 

15 comprises a constrained nonnative amino acid sequence. 

Such a vector preferably comprises a passenger gene which 
optionally is either inserted into the adenoviral genome 
or is attached to a coat protein (i.e., penton base, 
fiber, or hexon protein) of the adenovirus by means of a 

20 protein/DNA interaction. Alternately, the adenoviral 

vector preferably carries into a cell an unlinked DNA or 
protein molecule, or other small moiety, by means of 
adenovirus bystander-mediated uptake of these molecules 
(International Patent Application WO 95/21259). 

25 Along these lines, the method of the invention can 

be employed to transfer nucleic acid sequences which are 
transported as part of the adenoviral genome (i.e., 
encoded by adenovirus) , and to transfer nucleic acid 
sequences that are attached to the outside of the 

30 adenoviral capsid (Curiel et al., supra ) , as well as 
unattached DNA, protein, or other small molecules that 
similarly can be transported by adenoviral bystander- 
mediated uptake (International Patent Application WO 
95/21259) - The method can be employed to mediate gene 

35 and/or protein delivery either ex vivo or in vivo, as 
described herein. 


wo 98/07865 PCTA;S97/14719 

29 

Desirably, a vector is a viral vector selected from 
the group consisting of nonenveloped viruses. Such a 
vector desirably comprises a nonnative amino acid 
sequence according to the invention and/or a nucleic acid 
5 sequence that encodes such nonnative amino acid sequence. 
Optimally, the vector is an adenoviral vector, 
particularly an adenoviral vector selected from the group 
consisting of AdZ.FLAG, AdZ.RKKK2, AdZ.pGS, AdZ.RGD, 
AdZ.pRGD, AdZ.pLDV, and AdZ.pYIGSR. 
10 The means of making the recombinant adenoviral 

vectors according to the invention are known to those 
skilled in the art. For instance, recombinant adenovirus 
comprising a chimeric fiber protein and the recombinant 
adenovirus that additionally comprises a passenger gene 
15 or genes capable of being expressed in a- particular cell 
can be generated by use of a transfer vector, preferably 
a viral or plasmid transfer vector, in accordance with 
the present invention. Such a transfer vector preferably 
comprises a chimeric adenoviral fiber sequence as 
20 previously described. The chimeric fiber protein gene 
sequence comprises a nonnative (i.e., non-wild-type) 
sequence in place, of the native sequence, which has been 
deleted, or in addition to the native sequence. 

A recombinant chimeric fiber protein gene sequence 
25 can be moved to or from an adenoviral vector from or into 
a baculovirus or a suitable prokaryotic or eukaryotic 
expression vector for expression and evaluation of 
receptor or protein specificity and avidity, 
trimerization potential, penton base binding, and other 
30 biochemical characteristics. In particular, the method 
of protein production in baculovirus as set forth in the 
Examples which follow, and as described in Wickham et al. 
(1995), supra , can be employed. 

Accordingly, the present invention also provides 
35 recombinant baculoviral and prokaryotic and eukaryotic 

expression vectors comprising a chimeric adenoviral fiber 
protein gene sequence, which also can be transfer 


vectors. The present invention also provides vectors 
that fall under a conunonly employed definition of 
transfer vectors, . e . g . , vectors which are piasmids 
containing adenovirus sequences that are used to create 
5 new adenovirus vectors. The chimeric fiber protein gene 
sequence includes a nonnative sequence in addition to or 
in place of a native amino acid sequence. This enables 
the resultant chimeric fiber protein to bind to a binding 
site other than a binding site bound by the native 
10 sequence. By moving the chimeric gene from an adenoviral 
transfer vector to baculovirus or a prokaryotic or 
eukaryotic expression vector, high protein expression is 
achievable (approximately 5-50% of the total protein 
being the chimeric fiber) . Preferred transfer vectors 
15 according to the invention are selected from the group ' 
consisting of pl93(F5*), pl93 F5F2K(FLAG), pl93 F5F2K, 
pl93 F5F2K(RKKK2) , pl93 ( F5 ) pGS (RGD) , pi 93 < F5 ) pLDV, 
pl93 (F5)pYIGSR, and pl93 ( F5*) RGD. 

A vector according to the invention further can 
20 comprise, either within, in place of, or outside of the 
coding sequence of a fiber protein additional sequences 
that impact upon the ability of the fiber protein to 
trimerize, or comprise a protease recognition sequence. 
A sequence that impacts upon the ability to trimerize is 
25 one or more sequences that enable fiber trimerization . A 
sequence that comprises a protease recognition sequence 
is a sequence that can be cleaved by a protease, thereby 
effecting removal of the chimeric coat protein (or a 
portion thereof) and attachment of the recombinant 
30 adenovirus to a cell by means of another coat protein. 
When employed with a fiber protein, the protease 
recognition site preferably does not affect fiber 
trimerization or receptor specificity of the fiber 
protein. For instance, in one embodiment of the present 
35 invention, preferably the fiber protein, or a portion 
thereof, is deleted by means of a protease recognition 
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sequence, and then the penton base protein, or another 

protein, comitiands cell binding/cell entry. 

In terms of the production of vectors and transfer 

vectors according to the invention, transfer vectors are 
5 constructed using standard molecular and genetic 

techniques such as are known to those skilled in the art. 

Vectors comprising virions or virus particles are 

produced using viral vectors in the appropriate cell 

lines. Similarly, the adenoviral fiber chimera- 
10 containing particles are produced in standard cell lines, 

e.g., those currently used for adenoviral vectors. 

Following production and purification, the particles in 

which fiber is to be deleted are rendered fiberless 

through digestion of the particles with an appropriate 
15 sequence-specific protease, which cleaves the fiber 

proteins "and releases them from the viral particles to 

generate fiberless particles. 

Illustrative Uses 
20 The present invention provides a chimeric fiber 

protein that is able to bind to cells and mediate entry 
into cells with high efficiency, as well as vectors and 
transfer vectors comprising same. The chimeric fiber 
protein itself has multiple uses, e.g., as a tool for 
.25 studies in vitro of adenovirus binding to cells (e.g., by 
Scatchard analysis as shown previously by Wickham et al. 
(1993), supra ) , to block binding of adenovirus to 
receptors in vitro (e.g., by using antibodies, peptides, 
and enzymes, as described in the Examples herein and as 
30 known in the art), and, with use of some chimeric fiber 
proteins comprising particular peptide motifs, to protect 
against adenoviral infection in vivo by competing for 
binding to the binding site by which adenovirus effects 
cell entry. 

35 A vector comprising a chimeric fiber protein also 

can be used in strain generation and as a means of making 
new vectors. For instance, the nonnative amino acid 
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sequence can be introduced intraceliularly as a means of 
generating new vectors via recombination. Similarly, a 
vector can be used' in gene therapy. For instance, a 
vector of the present invention can be used to treat any 
j one of a number of diseases by delivering to targeted 
cells corrective DNA, i.e., DNA encoding a function that 
is either absent or impaired, or a discrete killing - 
agent, e.g., DNA encoding a cytotoxin that, for example, 
is active only intraceliularly. Diseases that are 
I candidates for such treatment include, for example, 

cancer, e.g., melanoma, glioma or lung cancers; genetic 
disorders, e.g., cystic fibrosis, hemophilia or muscular 
dystrophy; pathogenic infections, e.g., human 
immunodeficiency virus, tuberculosis or hepatitis; heart 
disease, e.g., preventing restenosis following 
angioplasty or promoting angiogenesis to reperfuse 
necrotic tissue; and autoimmune disorders, e.g., Crohn's 
disease, colitis or rheumatoid arthritis. 

In particular, gene therapy can be carried out in 
the treatment of diseases, disorders, or conditions 
associated with different tissues that, prior to the 
present invention, adenovirus was not able to bind to and 
enter, or could do so only with low affinity and/or 
specificity. For instance, the method can be employed to 
incorporate a targeting sequence which permits an 
increased efficiency of gene delivery to different 
tissues. Such targeting sequences include, but are not 
limited to: a heparin binding domain (e.g., polyK, 
polyR, or combinations thereof) ; an integrin binding 
domain {e.g., RGD, LDV, and the like); a laminin receptor 
domain (e.g., YIGSR [SEQ ID NO: 66]); a DNA binding domain 
(e.g., polyK, polyR, or combinations thereof); antibody 
epitopes (e.g., the FLAG peptide DYKDDDDK [SEQ ID N0:2] 
or other epitope); a brain-specific targeting domain 
(e.g., SLR); and any other peptide domain which binds to 
a receptor (e.g., in particular, a peptide domain ranging 
from about 2 to 200 amino acids) . 
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Along these lines, the method can be employed to 
increase the efficiency of adenoviral-mediated delivery 
to, for instance, bone marrow cells, endothelium, organs 
such as lung, liver, spleen, kidneys, brain, eye, heart, 
5 muscle, and the like, hematopoietic cells, tumor 

vasculature, and tumor cells. Diseases, disorders, or 
conditions associated with these tissues include, but are 
not limited to angiogenesis , restenosis, inflammation, 
cancers, Alzheimer's disease, human immunodeficiency 

10 virus (HIV-1, HIV-2) infection, and anemias. 

These aforementioned illustrative uses are by no 
means comprehensive, and it is intended that the present 
invention encompasses such further uses which flow from, 
but are not explicitly recited in the disclosure herein. 

15 Similarly, there are numerous advantages associated with 
the use of the various aspects of the present invention. 

For instance, with incorporation of antibody 
epitopes into the fiber protein, if the antibody epitope 
is in a loop close to the fiber receptor binding domain, 

20 then binding of the bispecific antibody will block normal 
receptor binding, thereby increasing the specificity of 
cell targeting using the antibody epitope. If the fiber 
receptor binding domain is mutated such that it no longer 
binds its receptor, then incorporation of specific 

25 receptor binding domains into the loop will allow 

targeting to those tissues that express the complementary 
receptor in the absence of any competing binding mediated 
by the wild-type fiber receptor binding domain. 

Similarly, a domain which permits inactivation of 

30 fiber for its normal receptor binding also can be 

incorporated into an exposed loop of the fiber protein. 
Inactivation of the fiber binding to its normal receptor 
will permit specific targeting via another protein or 
domain of adenovirus. For instance, ttv integrin 

35 targeting with native penton base can be accomplished in 
this fashion. Along these lines, an enterokinase 
cleavage site (e.g., DYKDDDDK [SEQ ID N0:2]) or trypsin 
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cleavage site (e.g., RKKKRKKK (SEQ ID N0:1J) can be 
incorporated into a fiber loop followed by treatment of 
adenoviral particles with enterokinase or trypsin. 
Native adenovirus particles are inmune to such 
5 enterokinase or trypsin treatment. 

Furthermore, a vector according to the invention, 
particularly an adenoviral vector, is advantageous in 
that it can be isolated and purified by conventional 
means. Since changes in the vector are made at the 
10 genome level, there are no cumbersome and costly post- 
production modifications required, as are associated with 
other vectors (see, e.g.. Gotten et al., Proc. Natl. 
Acad. Sci., 89, 6094-6098 (1992); Wagner et al . , Proc. 
Natl. Acad. Sci.. 89, 6099-6103 (1992)). Similarly, 
15 special adenoviral receptor-expressing cell lines are not 
required. An adenoviral vector comprising the chimeric 
fiber protein can be propagated to similar titers as a 
wild-type vector lacking the fiber modification. 

20 Means of Administration 

The vectors and transfer vectors of the present 
invention can be employed to contact cells either in 
vitro or in vivo. According to the invention 
"contacting" comprises any means by which a vector is 

25 introduced intracellularly; the method is not dependent 
on any particular means of introduction and is not to be 
so construed. Means of introduction are well known to 
those skilled in the art, and also are exemplified 
herein. 

30 Accordingly, introduction can be effected, for 

instance, either in vitro (e.g., in an ex vivo type 
method of gene therapy or in tissue culture studies) or 
in vivo by eiectroporation, transformation, transduction, 
conjugation or triparental mating, (co-) transf ection, 

35 (CO-) infection, membrane fusion with cationic lipids, 
high velocity bombardment with DNA-coated 
microprojectiles, incubation with calcium phosphate-DNA 
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precipitate, direct microinjection into single cells, and 
the like. .Similarly, the vectors can be introduced by 
means of cationic lipids, e.g., liposomes. Such 
liposomes are commercially available (e.g., Lipofectin®, 
5 Lipofectamine'™, and the like, supplied by Life 

Technologies, Gibco BRL, Gaithersburg, MD) . Moreover, 
liposomes having increased transfer capacity and/or 
reduced toxicity in vivo (see, e.g.. International Patent 
Application WO 95/21259) can be employed in the present 

10 invention. Other methods also are available and are 
known to those skilled in the art. 

According to the invention, a "host" (and thus a 
"cell" from a host) encompasses any host into which a 
vector of the invention can be introduced, and thus 

15 encompasses an animal, including, but not limited to, an 
amphibian, bird, fish, insect, reptile, or mammal. 
Optimally a host is a mammal, for instance, rodent, 
primate (such as chimpanzee, monkey, ape, gorilla, 
orangutan, or gibbon), feline, canine, ungulate (such as 

20 ruminant or swine), as well as, in particular, human. 

Desirably such a host cell is one in which an adenovirus 
can exist for a' period of time (i.e., typically from 
anywhere up to, and potentially even after, about two 
months) after entry into the cell. 

25 A cell can be present as a single entity, or can be 

part of, a larger collection of cells. Such a "larger 
collection of cells" can comprise, for instance, a cell 
culture (either mixed or pure), a tissue (e.g., 
epithelial or other tissue), an organ (e.g., heart, lung, 

30 liver, gallbladder, urinary bladder, eye, and other 
organs), an organ system (e.g., circulatory system, 
respiratory system, gastrointestinal system, urinary 
system, nervous system, integumentary system or other . 
organ system), or an organism (e.g., a bird, mammal, or 

35 the like) . Preferably, the peptide binding motif 
employed for cell targeting is such that the 
organs/tissues/cells being targeted are of the 
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circulatory system (e.g., including, but not limited to 
heart, blood vessels, and blood) , respiratory system 
(e.g., nose, pharynx, larynx, trachea, bronchi, 
bronchioles, lungs, and the like), gastrointestinal 
5 system (e.g., including mouth, pharynx, esophagus, 

stomach, intestines, salivary glands, pancreas, liver, 
gallbladder, and others), urinary system (e.g., such as 
Icidneys, ureters, urinary bladder, urethra, and the 
like), nervous system (e.g., including, but not limited 

10 to brain and spinal cord, and special sense organs such 
as the eye) and integumentary system (e.g., skin). Even 
more preferably, the cells being targeted are selected 
from the group consisting of heart, hematopoietic, lung, 
liver, spleen, kidney, brain, eye, bone marrow, 

15 endothelial, muscle, tumor vasculature, and tumor cells. 

One skilled in the art will appreciate that suitable 
methods of administering a vector (particularly an 
adenoviral vector) of the present invention to an animal 
for purposes of gene therapy (see, for example, Rosenfeld 

20 et al.. Science , 252 , 431-434 (1991); Jaffe et al., Clin. 
Res. , 39(2) , 302A (1991); Rosenfeld et al., Clin. Res. , 
39(2) , 311A (1991); Berkner, BioTechniques , 6, 616-629 
(1988); Crystal et al.. Human Gene Ther . , 6, 643-666 
(1995); Crystal et al.. Human Gene Ther. , 6, 667-703 

25 (1995)), chemotherapy, and vaccination are available, 
and, although more than one route can be used for 
administration, a particular route can provide a more 
immediate and more effective reaction than another route. 
Pharmaceutically acceptable excipients also are well- 

30 known to those who are skilled in the art, and are 
readily available. The choice of excipient will be 
determined in part by the particular method used to 
administer the recombinant vector. Accordingly, there is 
a wide variety of suitable formulations for use in the 

35 context of the present invention. The following methods 
and excipients are merely exemplary and are in no way 
limiting . 
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Moreover, to optimize the ability of the adenovirus 
to enter the cell by the method of the invention, 
preferably the method is carried out in the absence of 
neutralizing antibodies directed against the particular • 
adenovirus being introduced intracellularly . In the 
absence of such antibodies, there is no possibility of 
the adenovirus being bound by the antibody, and thus 
impeded from binding and/or entering the cell. It is 
well within the ordinary skill of one in the art to test 
for the presence of such neutralizing antibodies. 
Techniques that are known ,^ in the art can be employed to 
prevent the presence of neutralizing antibodies from 
impeding effective protein production (see, e.g., 
Crompton et al., supra , International Patent Application 
WO 96/12406).. 

Formulations suitable for oral administration can 
consist of (a) liquid solutions, such as an effective 
amount of the compound dissolved in diluents, such as 
water, saline, or orange juice; .(b) capsules, sachets or 
tablets, each containing a predetermined amount of the 
active ingredient, as solids or granules; (c) suspensions 
in an appropriate liquid; and (d) suitable emulsions. 
Tablet forms can include one or more of lactose, 
mannitol, corn starch, potato starch, microcrystalline 
cellulose, acacia, gelatin, colloidal silicon dioxide, 
croscarmellose sodium, talc, magnesium stearate, stearic 
acid, and other excipients, colorants, diluents, 
buffering agents, moistening agents, preservatives, 
flavoring agents, and pharmacologically compatible 
excipients. Lozenge. forms can comprise the active 
ingredient in a flavor, usually sucrose and acacia or 
tragacanth, as well as pastilles comprising the active 
ingredient in an inert base, such as gelatin and 
glycerin, or sucrose and acacia, emulsions, gels, and the 
like containing, in addition to the active ingredient, 
such excipients as are known in the art. 
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A vector or transfer vector of the present 
invention, alone or in combination with other suitable 
components, can be made into aerosol formulations to be 
administered via inhalation. These aerosol formulations 
can be placed into pressurized acceptable propellants, 
such as dichlorodifluoromethane, propane, nitrogen, and 
the like. They may also be formulated as pharmaceuticals 
for non-pressured preparations such as in a nebulizer or 
an atomizer. 

Formulations suitable for parenteral administration 
include aqueous and non-aqueous, isotonic sterile 
injection solutions, which can contain anti-oxidants, 
buffers, bacteriostats , and solutes that render the 
formulation isotonic with the blood of the intended 
recipient, and aqueous and non-aqueous sterile 
suspensions that can include suspending agents, 
solubilizers, thickening agents, stabilizers, and 
preservatives. The formulations can be presented in 
unit-dose or multi-dose sealed containers, such as 
ampules and vials, and can be stored in a freeze-dried 
(lyophilized) condition requiring only the addition of 
the sterile liquid excipient, for example, water, for 
injections, immediately prior to use. Extemporaneous 
injection solutions and suspensions can be prepared from 
sterile powders, granules, and tablets of the kind 
previously described. 

Additionally, a vector or transfer vector of the 
present invention can be made into suppositories by 
mixing with a variety of bases such as emulsifying bases 
or water-soluble bases. 

Formulations suitable for vaginal administration can 
be presented as pessaries, tampons, creams, gels, pastes, 
foams, or spray formulas containing, in addition to the 
active ingredient, such carriers as are known in the art 
to be appropriate. 

The dose administered to an animal, particularly a 
human, in the context of the present invention will vary 
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with the gene of interest, the composition employed, the 
method of administration, and the particular site and 
organism being treated. However, the dose should be 
sufficient to effect a therapeutic response. 

As previously indicated, a vector or a transfer 
vector of the present invention also has utility in 
vitro. Such a vector can be used as a research tool in 
the study of adenoviral attachment and infection of cells 
and in a method of assaying binding site-ligand 
interaction. Similarly, the chimeric fiber protein 
comprising a constrained nonnative amino acid sequence in 
addition to or in place of a native amino acid sequence 
can be used in receptor-ligand assays and as adhesion 
proteins in vitro or in vivo, for example. 

Examples 

The following examples further illustrate the 
present invention and, of course, should not be construed 
as in any way limiting its scope. 

Example 1 

This example describes the construction of transfer 
vectors encoding fiber sequences having insertions of 
various peptide motifs in exposed loops of the knob 
region of the adenovirus fiber protein. 

The fiber proteins of Ad2 and AdS both recognize the 
same receptor. A parallel evaluation of the protein 
structure of the fiber knob and its DNA restriction map 
reveals that the Ad2 fiber knob contains a unique Spe 1 
restriction site in a region encoding an exposed loop in 
the protein. The amino acids in this loop are not 
involved in any interactions relevant to protein folding. 
Accordingly, additions to this loop are highly unlikely 
to affect the ability of the fiber protein to fold. 
Chimeric adenoviral fiber proteins comprising 
modifications of an exposed loop (particularly the HI 
loop) were constructed as described herein. 
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For vector construction and characterization, 
standard molecular and genetic techniques, such as the 
generation of strains, plasmids, and viruses, gel 
electrophoresis, DNA manipulations including plasmid 
isolation, DNA cloning and sequencing. Western blot 
assays, and the like, were performed such as are known to 
those skilled in the art, and as are described in detail 
in standard laboratory manuals (e.g., Maniatis et al.. 
Molecular Cloning: A Lab oratory Manual . 2nd ed. (Cold 
Spring Harbor, NY, 1992); Ausubel et al . , Current 
Protocols in Molecular Blnlo^y (1987)). Restriction 
enzymes and other enzymes used for molecular ' 
manipulations were purchased from commercial sources 
(e.g., Boehringer Mannheim, inc., Indianapolis, Indiana- 
New England Biolabs, Beverly, Massachusetts; Bethesda 
Research Laboratories, Bethesda, Maryland), and were used 
according to the recommendations of the manufacturer. 
Cells employed for experiments (e.g., cells of the 
transformed human embryonic kidney cell line 293 (i.e., 
CRL 1573 cells) and other cells supplied by American Type 
Culture Collection) were cultured and maintained using 
standard sterile culture reagents, media and techniques, 
as previously described (Erzerum et al . , Nucleic Acids 
Research . 21, 1607-1612 (1993)). ' 

In order to make recombinant adenovirus vectors 
containing targeting sequences by ligation of restriction 
digest fragments, it was first necessary to exchange the 
knob region of the Ad5 present in a transfer vector with 
the knob coding region from Ad2, since the HI loop of Ad2 
comprises a unique Spe I restriction site, which allows 
cloning of particular targeting sequences into this site 
The net result of this vector manipulation was to create 
a fiber chimera in which the DNA encoding the tail and 
shaft of the fiber are from Ad5, the DNA encoding the 
35 knob is from Ad2, and the knob further comprises a 

nonnative amino acid sequence in the HI loop as depicted 
in Figure 1. Alternatively, standard oligonucleotide- 


15 


20 


25 


30 


WO98/07M5 PCT/US97/14719 

41 

mediated site-directed mutagenesis was used according to 
the manufacturer's directions (Stratagene, La Jolla, CA) . 
For site-directed mutagenesis, there is no need to swap 
the fiber knobs, because unique restriction sites are not 
5 required. In yet another alternative method of the 
invention described in later Examples, the targeting 
sequence is placed at the terminus of the fiber knob 
protein, as depicted in Figure 2. 

In the first step of the process of making fiber 

10 knob insertions in a loop, the transfer vector pl93(F5*) 
depicted in Figure 3 was constructed. This plasmid 
contains an 8 nucleotide insertion between the last amino 
acid codon of the fiber coding sequence and the stop 
codon. The 8 nucleotide insertion contains a unique Bam 

15 HI restriction site which allows a straightforward 

replacement of Ad5 fiber domains with other fiber domains 
from other adenovirus serotypes. Namely, the sequence of 
the wild-type AdS fiber gene is: 

TCA TAG ATT GCC CAA GAA TAA A [SEQ ID NO: 6] 

20 Ser Tyr He Ala Gin Glu * [SEQ ID N0:7] 

wherein the * indicates a termination codon. In 

comparison, the C-terminus of the mutated fiber. gene 

present in pl93(F5*) is: 

25 TCA TAG ATT GCC CAA GAA GGA TCC AAT AAA [SEQ ID NO: 8] 

Ser Tyr He Ala Gin Glu Gly Ser Asn Lys [SEQ ID NO: 9] 

wherein the underlined sequence indicates the Bam HI site 
introduced into the fiber protein- This Bam HI site also 

30 serves to code for the amino acids glycine and serine. 

The transfer plasmid pl93(F5*) was constructed from 
pl93NS(AF). The mutated fiber gene (i.e., the fiber gene 
comprising the Bam HI site prior to the stop codon) was 
incorporated into the fiber-minus plasmid pl93NS(AF) 

35 using synthetic sense and antisense oligonucleotide 
primers to amplify the fiber gene by means of the . 
polymerase chain reaction (PGR) while at the same time 
incorporating a modified Bam HI site following the last 
codon of the fiber gene to create the mutant fiber gene. 


The primers used to amplify from the Nde I site to the C- 
terminai coding regions of the fiber gene from Ad5 genome 
DNA were: antisense primer, T CCC CCC GGG TCT AGA TTA GGA 
TCC TTC TTG GGC AAT GTA TGA {Bam HI site underlined) [SEQ 
5 ID NO: 10]; sense primer CGT GTA TC C ATA TG A CAC AGA {Nde 
I site underlined) [SEQ ID N0:11]. The PGR product was 
then cut with Nde I and Bam HI and cloned into the Nde 
1/Bam HI sites of pl93NS(AF). 

The plasmid pl93NS{AF) itself was constructed by 
10 means of an intermediary series of vectors. Namely, 

first, the transfer plasmid pl93NS83-100 was constructed 
by cloning the Ad5 Nde I to Sal I fragment, which spans 
the 83-100 map unit region of the Ad5 genome containing 
the fiber gene, into the plasmid pNEB193 (New England 
15 Biolabs, Beverly, MA). The Nde 1-Mun I fragment was 
replaced with a synthetic oligonucleotide comprising a 
Bam HI site, which was flanked by a 5 ' Nde 1 site and a 
3' Mun I site to facilitate cloning. The double-stranded 
synthetic oligonucleotide fragment was created from the 
20 overlapping synthetic single-stranded sense (i.e., 

comprising the sequence TAT GGA GGA TCC AAT AAA GAA TCG 
TTT GTG TTA TGT TTC AAC GTG TTT ATT TTT C [SEQ ID NO: 12]) 
and antisense (i.e., comprising the sequence AAT TGA AAA 
ATA AAC ACG TTG AAA CAT AAC ACA AAC GAT TCT TTA TTG GAT 
25 CCT CCA [SEQ ID NO: 13]) oligonucleotides. The ends of 
the overlapping oligomers were made to have overhangs 
compatible for direct cloning into the Nde I and Mun I 
sites. The resultant vector pl93NS{AF) lacks all the 
coding sequence for the fiber gene but contains the 
30 entire adenovirus E4 coding sequence. The plasmid 

retains the AATAAA polyadenyiation signal included in the 
synthetic Nde 1/Mun I oligonucleotide and also 
incorporates the new Bam HI restriction site. 

Thus, following its construction in a series of 
35 sequential cloning steps, the transfer vector pl93(F5*) 
was employed in subsequent vector constructions. Namely, 
the sense oligonucleotide F5F2K(s)N (i.e., comprising the 
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sequence GGC CAT GGC CTA.GAA TTT GAT TCA AAC GGT GCC ATG 
ATT ACT AAA CTT GGA GCG [SEQ ID NO: 14] containing a Nco I 
restriction site) and the antisense oligonucleotide 
primer F5F2K(a)B (i.e., comprising the sequence GC GGA 
5 TCC TTA TTC CTG GGC AAT GTA GGA [SEQ ID NO: 15] containing 
a Bam HI restriction site) were used to amplify the knob 
coding region from purified Ad2 DNA by means of PCR. The 
incorporation of these sites on either end of the PCR 
product permitted it to be cut with Nco 1 and Bam HI and 

10 cloned into the base plasmid pl93(F5*) to create the 

transfer vector pl93.F5F2K depicted in Figure 4. Unlike 
pl93{F5*), pl93 F5F2K contains a unique Spe I restriction 
site within, the Ad2 fiber gene encoding an exposed loop 
in the protein. Namely, the fiber gene present in pl93 

15 F5F2K comprises the mutated fiber sequence 

ATT ACA CTT AAT GGC ACT AGT GAA TCC ACA 
lie Thr Leu Asn Gly Thr Ser Glu Ser Thr 

GAA ACT [SEQ ID NO: 16] 

20 Glu Thr [SEQ ID NO:17] 

wherein the underlined sequence indicates the novel Spe I 
site introduced into the fiber gene. 

This vector was then used to clone targeting 
sequences into the Spe I site. In particular, a nucleic 
25 acid sequence encoding the FLAG peptide motif DYKDDDDK 

(i.e.. Asp Tyr Lys Asp Asp Asp Asp Lys. [SEQ ID N0:2]) and 
a nucleic acid sequence encoding the stretch of 8 basic 
amino acids RKKKRKKK (Arg Lys Lys Lys Arg Lys Lys Lys 
[SEQ ID N0:1]) comprising the heparin binding domain were 
30 cloned into the Spe I site of pl93 F5F2K using 

overlapping sense and antisense oligonucleotides. 

Namely, the PolyGS (RKKK) 2 sequence comprises: 
ACT AGA AAA AAA AAA CGC AAG AAG AAG 
Thr Arg Lys Lys Lys Arg Lys Lys Lys 


The 27-mer sense oligonucleotide PolyGS (RKKK) 2 (s) 
(i.e., comprising the sequence CT AGA AAG AAG AAA CGC AAA 
40 AAG AAG A [SEQ ID NO:20]) and 27-mer antisense 


35 


ACT AGT 
Thr Ser 


(SEQ ID NO:18] 
[SEQ ID N0:19] , 
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oligonucleotide PolyGS (RKKK) 2 (a ) (i.e., comprising the 
sequence CT AGT CTT CTT TTT GCG TTT CTT CTT T (SEQ ID 
NO: 21]) were employed for cloning the PolyGS (.RKKK) 2 
sequence comprising the RKKKRKKK [SEQ ID NO: 17] peptide 
i motif. This plasmid was constructed by cloning the DNA 
sequence encoding the binding domain into the Spe I site 
of pl93 F5FK2. The overlapping sense and antisense 
oligonucleotides encoding the binding domain were first 
annealed and then directly ligated into the Spe I 
restriction site to result in the plasmid pl93 
F5F2K(RKKK2) depicted in Figure 5. 

Similarly, the FLAG sequence comprises: 
ACT AGA GAC TAC AAG GAC GAC GAT GAT AAG 
Thr Arg Asp Tyr Lys Asp Asp Asp Asp Lys 

Jhr m fSEQ ID NO:22] 

^ [SEQ ID NO:23] . 

The 30-mer sense oligonucleotide FLAG(s) (i.e., 
comprising the sequence CT AGA GAC TAC AAG GAC GAC GAT 
GAT AAG A (SEQ ID NO: 24]) and 30-mer antisense 
oligonucleotide FLAG (a) (i.e., comprising the sequence CT 
AGT CTT ATC ATC GTC GTC CTT GTA GTC T [SEQ ID NO: 25]) 
were employed for cloning the FLAG peptide sequence in a 
similar fashion as for pl93 F5F2K(RKKK2) to result in the 
plasmid pl93 F5F2K(FLAG) depicted in Figure 6. 

The FLAG sequence is recognized by the anti-FLAG M2 
antibody (Kodak, New Haven, CT) and is used for targeting 
adenovirus by means of bispecific antibodies (Wickham et 
al., "Targeted Adenovirus Gene Transfer to Endothelial 
and Smooth Muscle Cells Using Bispecific Antibodies", j. 
Virol^, 70(10), 6831-6838 (1996)). The RKKKRKKK [SEQ ID 
NO: 17] peptide sequence recognizes cellular heparin 
sulfate and is used to target the adenovirus to heparin 
sulfate-containing receptors on cells. Because heparin 
sulfate moieties are expressed on nearly all mammalian 
cells, the heparin-binding motif permits AdF2K(RKKK2) to 
bind to and transduce a broad spectrum of cells, as 
compared to unmodified (i.e., wild-type) adenovirus 
vectors . 
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The plasmids pl93^ F5F2K(RKKK2) and pl93 F5F2K(FLAG) 
were confirmed to contain the correct inserts through use 
of PGR analysis and mobility shift assays done on DNA 
fragments generated by restriction digests of the 
5 plasmids. Namely, the relevant portion of the modified 
loop of the fiber knob present in pl93 F5F2K(RKKK2) is: 
ATT ACA CTT AAT GGC ACT AGA AAG AAG AAA CGC AAA AAG AAG 
He Thr Leu Asn Gly Thr Arg Lys Lys Lys Arg Lys Lys Lys 

10 ACT AGT GAA TCC ACA GAA ACT ( SEQ ID NO: 26] 

Thr Ser Glu Ser Thr Glu Thr [SEQ ID NO:27] . 

The relevant portion of the modified loop of the fiber 
knob present in pl93 F5F2K(FLAG) is: 

ATT ACA CTT AAT GGC ACT AGA GAC TAC AAG GAC GAC GAT GAT 
15 He Thr Leu Asn Gly Thr Afg Asp Tyr Lys Asp Asp Asp Asp 

AAG ACT AGT GAA TCC ACA GAA ACT [SEQ ID NO: 28] 

Lys Thr Ser Glu Ser Thr Glu Thr [SEQ ID NO:29] . 

20 In another embodiment of the invention, an 

additional mutant fiber protein was created by site- 
directed mutagenesis. Primers were synthesized to 
replace the DNA sequence encoding amino acids of the CD, 
FG, and IJ loops of the Ad5 fiber knob. More 

25 particularly, the sequence encoding the amino acid 

sequence SGTVQ [SEQ ID NO: 81] (amino acids 44 9 to 453 of 
. the CD loop of the native Ad5 fiber knob) was replaced 
with DNA encoding the amino acid sequence Gly Ser Gly Ser 
Gly [SEQ ID NO:821 by carrying out site-directed 

30 mutagenesis of the plasmid pAcSG2 F5KN (Roelvink et al., 
using the primers 

GGC AGT TTG GCT CCA ATA GGA TCC GGG TCT GGA AGT GCT CAT 
CTT ATT [SEQ ID NO: 83] 
35 and 

AAT AAG ATG AGO ACT TCC AGA CCC GGA TCC TAT TGG AGC CAA 
ACT GCC [SEQ ID NO:85] 

Similarly, using appropriate primers for site-directed 
mutagenesis, the amino acid sequences Ser His Gly Lys Thr 
40 Ala [SEQ ID NO:86] (amino acids 507-512) of the FG loop 


15 
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and Ser Gly His Asn [SEQ ID NO:87] (amino acids 559-562) 
of the IJ loop, were replaced with Gly Ser Gly Ser Gly 
Ser fSEQ ID NO:88] and Gly Ser Gly Ser (SEQ ID NO:89] 
respectively. The resultant baculovirus transfer vectors 
5 containing the mutated fiber knob genes were used to make 
recombinant baculovirus. The recombinant baculoviruses 
were used to make recombinant fiber knob proteins 
containing the mutations. The resultant proteins were 
found to be fully soluble, indicating that they had 
10 correctly folded into trimers. Additionally, the soluble 
proteins were used to block adenovirus binding to cells 
which indicates that the substitutions in the fiber gene 
did not disrupt binding^o the fiber receptor. 

These results thus confirm that the methods 
described herein can be employed to construct transfer 
vectors encoding fiber sequences having insertions of 
various peptide motifs in an exposed loop of the knob 
region of the adenovirus fiber protein. 

Example 2 

This example describes the construction of 
adenoviral vectors encoding fiber sequences having 
insertions of various peptide motifs in a loop of the 
knob region of the adenovirus fiber protein. 
25 The transfer plasmids pl93 F5F2K(RKKK2) and pl93 

F5F2K(FLAG) were employed to obtain the corresponding 
adenoviral vectors comprising the FLAG and RKKK2 peptide 
motifs. This was accomplished by digesting these 
plasmids (which contain the essential E4 region of 
adenovirus) with Sal I, and transfecting them into 293 
cells that already had been infected 1 hour earlier with 
the adenovirus vector AdZ.E4Gus. This adenovirus vector 
lacks the E4 region and cannot replicate in 293 cells 
without the E4 genes. Only when AdZ.E4Gus DNA recombines 
35 with plasmid DNA such as pl93 F5F2K, pl93 F5F2K(FLAG), 
and pl93 F5F2K{RKKK2) to obtain the E4 genes is the 
vector able to replicate in 293 cells. During this 


30 
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recombination to rescue the adenoviral vector, the newly 
formed vector also picks up the mutated fiber sequence 
encoded by the plasmids. 

Viable recombinant E4* adenovirus containing the 
5 F2K(RKKK2) and F2K(FLAG) DNA sequences (i.e., AdZ . FLAG 
and AdZ.BKKK2) were isolated by plaguing the transfected 
cell lysates 5 days after transf ection . The recombinant 
adenoviruses were then plaque-purified 2 times on 293 
cells. The purified plaques were amplified on 293 cells. 

10 All viruses were purified from infected cells at 2 days 
post-infection by 3 freeze-thaw cycles followed by two 
successive bandings on CsCl gradients. Purified virus 
was dialyzed into 10 mM Tris, 150 mM NaCl, pH 7.8, 
containing 10 mM MgCl2, 3% sucrose, and was frozen at -80° 

15 until required for use. The purified viruses were 

verified by PGR to contain either the RKKK2 insert or the 
FLAG insert. 

These adenoviral vectors and the sequences they 
specifically target due to their possession of modified 
20 fiber knobs are depicted in Table 1. 


Taible 1. Adenoviral Vectors Coaprising Constreti-ned 
Pepti.de Motifs 


Vector Name 

Target Receptor 

Target Sequence 

AdZ . FLAG 

Any receptor (with 
use of a bispecific 
antibody) 

TRDYKDDDDKTS 
Thr Arg Asp Tyr 
Lys Asp Asp Asp 
Asp Lys Thr Ser 
[SEQ ID N0:23] 

Adz . RKKK2 

Heparin sulfate- 
containing receptors 

TRKKKRKKKTS 
Thr Arg Lys Lys 
Lys Arg Lys Lys 
Lys Thr Ser [SEQ 
ID NO:19] 


These results thus confirm that the methods 
described herein can be employed to construct adenoviral 
vectors encoding fiber sequences having insertions of 
various peptide motifs in an exposed loop of the knob 
30 region of the adenovirus fiber protein. 
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Example 3 

This example describes the characterization of 
adenoviral vectors encoding fiber sequences having 
5 insertions of various peptide motifs in a loop of the 
knob region of the adenovirus fiber protein. 

The FLAG insert present in the AdZ.FLAG vector was 
shown to be functionally accessible and capable of 
binding the anti-FLAG M2 mAB as assessed by 
10 immunofluorescence, as previously described (Wickham et 
al., 1993). Briefly, 293 cells were infected at a low 
multiplicity of infection (i.e., about a 0.02 MOI) with 
the AdZ.RKKK2 or AdZ.FLAG isolates. The cells were fixed 
at two days post -infection and incubated with either a 
5 rabbit anti-penton base polyclonal antibody or a mouse 
anti-FLAG mAB, followed by incubation with anti-rabbit or 
anti-mouse FITC antibody. The anti-penton base antibody 
recognized cells infected by either virus. In 
comparison, the FLAG mAB recognized only the cells 
0 infected with the AdZ.FLAG virus, and not the cells 
infected with the AdZ.RKKK2 virus. 

These results confirm that adenoviruses produced 
according to the method of the invention are viable, and 
that the insert (e.g., FLAG epitope) present in an 
5 exposed loop of fiber protein is accessible to and 
capable of binding its corresponding binding entity 
(e.g., a cell surface binding site or an antibody such as 
the anti-FLAG antibody) . These results confirm that the 
method of the invention can be employed for adenoviral- 
0 mediated cell targeting. 

Example 4 

This example describes gene delivery mediated by 
adenoviral vectors encoding fiber sequences having 
5 insertions of various peptide motifs in an exposed loop 
of the knob region of the adenovirus fiber protein. 
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For testing the ability of the RKKK2 motif to effect 
cell targeting, 293 cells (which appear to express 
relatively high levels of the receptor by which wild-type 
adenovirus fiber protein effects cell entry) were 
5 preincubated for 30 minutes in the presence and absence 
of competing wild-type fiber protein. Purified AdZ or 
AdZ.RKKK2 vectors were then incubated with the cells for 
an additional 60 minutes at 37''C. The cells were washed 3 
times with PBS, and incubated in culture medium 

10 overnight. (J-galactosidase activity from lysed cells was 
then determined using a p-galactosidase. fluorometric 
assay kit (Tropix, Bedford, MA) . Activity was measured 
in a luminometer in relative light units (RLU) . 

The data illustrated in Figure 7 demonstrates gene 

15 delivery to 293 cells effected by the AdZ.RKKK2 vector. 
As can be seen from this figure, recombinant wild-type 
fiber protein blocked gene delivery by AdZ, but not by 
AdZ.RKKK2. The AdZ . RKKK2 vector was able to overcome the 
fiber-mediated block to adenoviral-mediated gene 

20 delivery. 

These results confirm that this constrained peptide 
motif present in the fiber loop is able to efficiently 
mediate cell binding/entry. Moreover, the results 
further confirm that adenoviral vectors encoding fiber 
25 sequences having insertions of various peptide motifs in 
an exposed loop of the knob of the adenovirus fiber 
protein can be employed for delivery (e.g., of DNA and/or 
protein) to cells. 

30 Example 5 

This example describes other oligonucleotides that 
can be employed for inserting a nonnative amino acid 
sequence into a chimeric adenovirus fiber protein, 
preferably in an exposed loop of the adenovirus fiber 
35 knob, but also at the C-terminus of the protein. 

The cloning techniques described in the previous 
example can be employed to incorporate into an exposed 


loop of the fiber knob inserts comprising peptide motifs 
that will target, for instance, Ov integrins, asPi 
integrin, FLAG mAb, or other cell surface binding sites. 
In particular, an HAav sequence can be inserted. 
3 This sequence comprises: 

ACT AGA GCC TGC GAC TGT CGC GGC GAT TGT TTT TGC GGT 
Thr Arg Ala Cys Asp Cys Arg Gly Asp Cys Phe Cys Gly 

) Thl ter ID NO: 30] 

(SEQ ID N0:31] . 

The sequence can be inserted with use of the 39-mer sense 
oligonucleotide HAav(s) (i.e., comprising the sequence CT 
AGA GCC TGC GAC TGT CGC GGC GAT TGT TTT TGC GGT A [SEQ ID 
NO: 32]) and the 30-mer antisense oligonucleotide HAav{a) 
(i.e., comprising the sequence CT AGT ACC GCA AAA ACA ATC 
GCC GCG ACA GTC GCA GGC T . [SEQ ID NO:33]). These 
oligonucleotides were used to make pl93 (F5* ) pGS (RGD) , 
which was used to make AdZ.RGD. 

Similarly, an HAospi sequence can be inserted that 
allows targeting for integrin aspi . This representative 
sequence comprises: 

ACT AGA TGC CGC CGC GAA ACC GCT TGG GCC TGT 
Thr Arg Cys Arg Arg Giu Thr Ala Trp Ala Cys 

ThI fS^Q ID NO: 34] 

[SEQ ID NO:35] ) . 

The sequence can be inserted with use of the 39-mer sense 
oligonucleotide HAasPi{s) (i.e., comprising the sequence 
CT AGA TGC CGC CGC GAA ACC GCT TGG GCC TGT A [SEQ ID 
NO:36]) and the 39-mer antisense oligonucleotide HAUsPiia) 
(i.e., comprising the sequence CT AGT ACA GGC CCA AGC GGT 
TTC GCG GCG GCA T [SEQ ID NO:37]). 

These sequences (and other sequences described 
herein) that allow targeting to the integrins are of 
use since this target receptor demonstrates broad 
distribution, including to endothelial cells and smooth 
muscle cells. The adhesion receptor appears to be 
important in wounds (i.e., both healing and exacerbation 
thereof) , as well as in angiogenesis, restenosis and 
metastasis. Generally, the receptor is upregulated in 
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proliferating endothelial cells and smooth muscle cells, 
and exhibits high expression in melanoma and glioma. 
Normal ligands for the ttv integrins receptor include 
vitronectin, collagen, fibronectin, laminin, and 
5 osteopontin. 

Also, an E-selectin targeting sequence can be 
inserted. A representative sequence comprises: 
ACT AGA GAC ATT ACC TGG GAC CAG CTT TGG GAC CTT ATG AAG 
Thr Arg Asp He Thr Trp Asp Gin , Leu Trp Asp Leu Met Lys 

10 

ACT AGT [SEQ ID NO: 38] 

Thr Ser [SEQ ID NO:39] . 

Further ligands that bind elastin have been described in 
the art and similarly can be employed as nonnative amino 

15 acid sequences for the generation of peptide motifs as 
described herein (see, e.g.. Martens et al., J. Biolog. 
Chem. , 270, 21129-21136 (1995)). The E-selectin sequence 
can be inserted with use of the 42-mer sense 
oligonucleotide E-selectin ( s ) (i.e., comprising the 

20 sequence CT AGA GAC , ATT ACC TGG GAC CAG CTT TGG GAC CTT 
ATG AAG A [SEQ ID NO:40]) and the 42-mer antisense 
oligonucleotide E-selectin (a) (i . e ., comprising the 
sequence CT AGT CTT CAT AAG GTC CCA AAG CTG GTC CCA GGT 
AAT GTC T [SEQ ID NG:41]). 

25 Furthermore, a PolyGS (RKKK) 3 sequence, or other 

variations of this sequence, can be inserted. This 
sequence comprises: 

ACT AGA AAG AAG AAG CGC AAA AAA AAA AGA AAG AAG AAG 
Thr Arg Lys Lys Lys Arg Lys Lys Lys Arg Lys Lys Lys 

30 . 

ACT AGT [SEQ ID NO: 42] 

Thr Ser [SEQ ID NO: 43] . 

The sequence can be inserted with use of the 39-mer sense 
oligonucleotide PolyGS (RKKK) 3 (s) (i.e., comparing the 
35 sequence CT AGA AAG AAG AAG CGC AAA AAA AAA AGA AAG AAG 
AAG A [SEQ ID NO:441) and the 39-mer antisense 
oligonucleotide PolyGS (RKKK) 3 (a) (i.e., comprising the 
sequence CT AGT CTT CTT CTT TCT TTT TTT TTT GCG CTT CTT 
CTT T [SEQ ID NO: 45] ) . 
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This example thus confirms that other 
oligonucleotides can be employed for inserting a 
nonnative amino acid sequence into a fiber protein. Such 
insertions can either be made in an exposed loop of the 
5 adenovirus fiber knob or, as described as follows, at the 
C-terminus of the fiber protein. Moreover, the nonnative 
amino acid sequence can be incorporated into the chimeric 
fiber protein not merely as an insertion into the 
sequence, but also as a replacement of adenoviral 
10 sequences. This can be done through modification of the 
cloning procedures described herein, as are known to 
those skilled in the art. 


Example 6 

15 In a similar fashion to the constraint achieved by 

placing a peptide motif within an exposed loop of the 
adenovirus fiber protein, constraint can be obtained 
through appropriate modification of a peptide motif at 
the C-terminus of the fiber protein to create, in 

20 essence, a nonpreexisting loop at this site. Thus, this 
example describes the construction of transfer vectors 
encoding fiber sequences having insertions of various 
constrained peptide motifs at the C-terir.inus of the 
adenovirus fiber protein. This method is depicted in 

25 Figure 2. 

The transfer vector pl93(F5*) described in Example 1 
was used as a base plasmid to create chimeric adenovirus 
particles containing C-terminal additions to the fiber 
gene. In particular, DNA sequences encoding a linker 

30 sequence followed by a targeting sequence and a stop 

codon were cloned into the Bam HI site to create further 
transfer vectors which, in turn (i.e., via the 
construction of the further transfer vectors 
pl93(F5)pGS(RGD) and pI93(F5)pGS) were used to make 

35 chimeric adenovirus particles. 

The mutant transfer plasmids containing sequences 
encoding an amino acid glycine/serine repeat linker, a 
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targeting sequence, and a stop codon were raade by cloning 
synthetic oligonucleotides into the Sam HI site of 
pl93(F5*). The cloning reactions essentially were 
carried out as described in Example 1. In particular> 
5 the overlapping synthetic oligonucleotides used to make 
the transfer plasmid pl93 (F5) pGS (RGD) depicted in Figure 
8 were: sense, GA TCA GGA TCA GGT TCA GGG AGT GGC TCT 
GCC TGC GAC TGT CGC GGC GAT TGT TTT TGC GGT TAA G [SEQ ID 
NO: 46]; antisense, GA TGC TTA ACC GCA AAA ACA ATC GCC GCG 

10 ACA GTC GCA GGC AGA GCC ACT CCC TGA ACC TGA TCC T [SEQ ID 
NO:47]. This plasmid comprises the nucleic sequence GCC 
CAA GAA GGA TCA GGA TCA GGT TCA GGG AGT GGC TCT GCC TGC 
GAC TGT CGC GGC GAT TGT TTT TGC GGT TAA GGA TCC AAT AA 
[SEQ ID NO:48] that encodes the amino acid sequence Ala 

15 Gin Glu Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Ala Cys 
Asp Cys Arg Gly Asp Cys Phe Cys Gly *** [SEQ ID NO: 49], 
wherein *** refers to the stop codon. The RGD peptide is 
present within this larger sequence. The plasmid 
pl93 (F5) pGS (RGD) thus -comprises the targeting sequence 

20 CDCRGDCFC ■ (i.e., Cys Asp Cys Arg Gly Asp Cys Phe Cys [SEQ 
ID N0:3]) which is present in the larger sequence Ser Ala 
Cys Asp Cys Arg Gly AspCys Phe Cys Gly [SEQ ID NO:79]. 
This sequence, like other sequences described earlier 
containing the tripeptide motif RGD, acts as a ligand for 

25 the target receptor as integrins . However, highly 

constrained forms of RGD bind with higher affinities to 
integrins than linear forms (see, e.g., Aumailley et al., 
FEBS , 291 , 50-54 (1991); Cardarelli et al . , J. Biolog. 
Chem. , 269., 18668-18673 (1994); Koivunen et al., 

30 Bio/Technology , 13 , 265-270 (1995)). Along these lines, 
the constrained RGD targeting motif present in 
pl93 (F5)pGS(RGD) binds with about 100-fold higher 
affinity to ttv integrins than does similar linear RGD 
motifs. Each pair of cysteines on either side of the RGD 

35 form disulfide binds with the opposite pair of cysteines 
to form a highly constrained RGD loop. 


wo 98m86S 

PCT/US97/14719 

54 

Moreover, variations of the CRCRGDCFC fSEQ tq no-3I 
targeting sequence can be employed in the context of tt.e 
present invention.. For instance, instead of two cysteine 
res.dues on either side of the RCO tripeptide sequence 
5 oniy one resxdue can be used instead. Any sequence ca 
be employed, so long as a loop-li.e structure is created 
encompassing the RGD sequence, and so long as the 

tTLTs'"'''''' ^^^^^'"^ ^ 

0 e g. LOV substituted by another sequence. 

In tern,s of construction of the related transfer 
P asmxd pl93(F5,pGS, the overlapping synthetic 
Oligonucleotides used to .a.e the transfer plas.id were: 
sense, PolyGS,s,. GA TCC GGT TCA GGA TCT GGC AGT GGC TCG 
> ACT AGT TAA A fSEQ 10 NO: 50,; antisense, PolyGS (a) GA 

T T.A ACT AGT CGA GCC ACT GCC AGA TCC TGA IcC G U^io 
NO. 51], The sense and antisense oligonucleotides were 

of Pl93{r5*, to create pl93(F5)pGS. The transfer vector 
P (r5)pGS then was used to construct further transLr 
vectors, as described in the following Examples. 

Thus, thxs example confirms that transfer vectors 
encoding fiber sequences having insertions of various 
constrained peptide motifs at the C-terminus of the 
adenovirus fiber protein can be constructed according to 
the invention. Other transfer vectors (i.e., having 
different targeting sequences) also can be constructed 
using this approach. 

Example 7 

This example describes the construction of 
adenovirus vectors encoding fiber sequences having 
insertions of various constrained peptide motifs at the 
C- terminus of the adenovirus fiber protein. 

The El- and E3-deleted adenovirus AdZ employed for 
these experiments contains the p-galactosidase gene under 
the control of a cytomegalovirus (CMV) promoter and 
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integrated into the adenoviral genome. AdZ was 
propagated in human embryonic kidney 293 ceils, which 
contain the complementary El region for virus growth. 
AdZ.RGD (as well as other vectors targeted to other 
5 adhesion receptors described herein) was derived directly 
from AdZ. These viruses likewise are El- and E3-deleted, 
and are identical to AdZ, except for the presence of 
additional amino acids on the C-terminus of the fiber 
proteins. 

10 The transfer plasmids, pl93(F5)pGS and 

pl93 ( F5) pGS (RGD) , which contain the essential E4 region 
of adenovirus, were employed for adenoviral vector 
construction. These transfer plasmids were cut with Sal 
I and transfected into 293 cells that had been infected 

15 one hour prior with the adenovirus vector, AdZ.E4Gus. 

The adenovirus vector AdZ.E4Gus lacks the E4 region and 
cannot replicate in 293 cells without the E4 genes. Only 
when AdZ.E4Gus DNA recombines with the pl93(F5)pGS or 
pl93 (F5) pGS (RGD) plasmid DNA to obtain the E4 genes is 

20 the vector. able to replicate in 293 cells. During this 
recombination, the newly formed vector also picks up the 
fiber mutations encoded in the plasmids. Viable 
recombinant E4* adenovirus containing the pGS and pGS (RGD) 
mutations were then isolated by plaquing the transfected 

25 ceil lysates 5 days after transf ection . Their resultant 
vectors, AdZ.pGS and AdZ.RGD, were isolated and purified 
by two successive rounds of plaquing on 293 cells. Each 
vector was verified to contain the correct insert by 
sequencing PGR products from virus DNA that spans the 

30 region of the insert DNA. 

This example confirms that adenovirus vectors 
encoding fiber sequences having insertions of various 
constrained peptide motifs at the C-terminus of the 
adenovirus fiber protein can be constructed according to 

35 the invention. 
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Example 8 

This example describes the construction of transfer 
vectors and adenoviral vectors with use of other 
oligonucleotides that can be employed for inserting a 
5 nonnative amino acid sequence into a chimeric adenovirus 
fiber protein, preferably in an exposed loop of the 
adenovirus fiber knob, but also at the C-terminus of the 
protein. 

The cloning techniques described in Example 6 were 
10 employed to create additions at the C-terminus. 

Basically the transfer vectors described in this Example 
(in particular, the transfer vector pl93(F5)pGS) were 
linearized at the unique cloning site Spe I present in 
the vectors, and new sequences were inserted at this 
15 site. Other means (e.g., PGR reactions) also can be 
employed to make insertions into this unique site. 
Similarly, the cloning techniques described in Example 5 
can be employed to incorporate into an exposed loop of 
the fiber knob inserts comprising peptide motifs that 
20 target other cell surface binding sites or epitopes for 
an antibody. 

In particular, multiple copies of the RGD sequence 
(i.e., a polyRGD or pRGD sequence) were inserted. This 
sequence comprises: 

25 ACT AGT GGA AGA GGA GAT ACT TTT GGC CGC GGC GAC ACG TTC 
Thr Ser Gly Arg Gly Asp Thr Phe Gly Arg Gly Asp Thr Phe 

GGA AGG GGG GAT ACA TTT TCT AGT [SEQ ID NO: 52 1 

Gly Arg Gly Asp Thr Phe Ser Ser [SEQ ID NO:53]. 

30 The sequence was inserted with use of the sense 

oligonucleotide pRGDs (i.e., comprising the sequence CT 
AGT GGA AGA GGA GAT ACT TTT GGC CGC GGC GAC ACG TTC GGA 
AGG GGG GAT ACA TTT T [SEQ ID NO:54]) and the antisense 
oligonucleotide pRGDa (i.e., comprising the sequence CT 

35 AGA AAA TGT ATC CCC CCT TCC GAA CGT GTC GCC GCG GCC AAA 
AGT ATC TCC TCT TCC A [SEQ ID NO:55]). 

The resultant plasmid pl93(F5*)RGD was employed to 
create the adenovirus AdZ.pRGD. A comparison of the 
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inserts present in AdZ.RGD and AdZ.pRGD (with the RGD 
peptide indicated emboldened) is presented in Table 2. 

Table 2. Comparison of Adenoviral Vectors 
AdZ.RGD and AdZ.pRGD 

Vector Name Target Receptor Target Sequence 


av Integrins SACDCRGDCFCGTS 
[SEQ ID NO:68] 

av Integrins TS (GRGDTF) 3SS 

pi Integrins [SEQ ID NO:53] 


Similarly, one or more copies of an LDV targeting 

10 sequence can be inserted. The LDV target receptor is 
distributed in hematopoietic cells, lymphocytes, and 
monocytes/macrophages. The adhesion receptor is highly 
expressed on resting lymphocytes involved in cell-matrix 
and cell-cell interactions (e.g., during hematopoietic 

15 extravasation, as well as inflammation, and lymphocyte 
trafficking) . Ligands for the a* integrins target 
receptor include, but are. not limited to, fibronectin (an 
extracellular matrix protein), VCAM-1. (which targets 
endothelial tissue), and MAdCAM (a4P7) (which is gut- 

20 specific) . In particular, the 04 integrins targeting 
sequences includes the sequence EILDVPST (i.e., Glu He 
Leu Asp Val Pro Ser Thr [SEQ ID NO:56] encompassed by the 
sequence above, and the sequence (EILDVPS)3 (or, three 
copies of the peptide motif EILDVPS [SEQ ID NO: 80] in 

25 tandem, or Glu He Leu Asp Val Pro Ser Glu He Leu Asp 
Val Pro Ser Glu lie Leu Asp Val Pro Ser) [SEQ ID NO:57]. 

In particular, multiple copies of the LDV sequence 
(i.e., a polyLDV or pLDV sequence) can be inserted to 
comprise the sequence: 

30 ACT AGT GAA ATT CTT GAC GTC GGA GAG ATC CTC GAG GTC GGG 
Thr Ser Glu He Leu Asp Val Gly Glu He Leu Asp Val Gly 

GAA ATA CTG GAC GTC TCT AGT [SEQ ID NO: 58] 

Glu He Leu Asp Val' Ser Ser CSEQ ID NO:59] . 


wo 98/07865 PCT/US97/14719 

58 

This sequence was inserted with use of the sense 
oligonucleotide pLDVs (i.e., comprising the sequence CT 
AGT GAA ATT CTT GAC GTC GGA GAG ATC CTC GAC GTC GGG GAA 
ATA CTG GAC GTC T [SEQ ID^NO:60]) and the anti sense 
5 oligonucleotide pLDVa (i.e., comprising the sequence CT 
AGA GAC GTC CAG TAT TTC CCC GAC GTC GAG GAT CTC TCC GAC 
GTC AAG AAT TTC A (SEQ ID N0:61]). . 

Such insertion resulted in the generation of the 
vector pl93(F5)pLDV depicted in Figure 9. The LDV 

10 targeting motif present in this vector (i.e., comprising 
the sequence of SEQ ID NO:59) binds with sub-millimoiar 
affinity to integrins. The LDV motif is repeated 3 
times in- each fiber monomer for a total of 9 motifs per 
fiber molecule. This vector further was employed for the 

15 generation of a corresponding adenoviral vector. 

Furthermore, a pYIGSR targeting sequence was 
inserted at the C-terminus of the fiber protein to derive 
the plasmid pl93 (F5) pYIGSR depicted in Figure 10. The 
fiber protein in this plasmid comprises the amino acid 

20 sequence: 

ACT AGT GGA TAC ATC GGC AGT CGC GGT TAC ATT GGG TCC 
Thr Ser Gly Tyr He Gly Ser Arg Gly Tyr He Gly Ser 

25 CGA GGA TAT ATA GGC TCA AGA TCT AGT [SEQ ID NO: 62] 

Arg Gly Tyr He Gly Ser Arg Ser Ser (SEQ ID NO:63). 

The sequence was inserted with use of the sense 
oligonucleotide pYIGSRs (i.e., comprising the sequence CT 
AGT GGA TAC ATC GGC AGT CGC GGT TAC ATT GGG TCC CGA GGA 

30 TAT ATA GGC TCA AGA T (SEQ ID NO:64]) and the antisense 
oligonucleotide pYIGSRa (i.e., comprising the sequence CT 
AGA TCT TGA GCC TAT ATA TCC TCG GGA CCC AAT GTA ACC GCG 
ACT GCC GAT GTA TCC A (SEQ ID NO: 65] ) . 

The resultant plasmid contains the YIGSR [SEQ ID 

35 NO: 66] (i.e., comprising the sequence Tyr He Gly Ser Arg 
[SEQ ID NO: 66) targeting motif, which binds with sub- 
millimolar affinity to the high affinity laminin 
receptor. The YIGSR [SEQ ID NO: 66] motif, present as 
YIGSRG (i.e., comprising the sequence Tyr He Gly Ser Arg 
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Giy [SEQ ID NO:67]), is repeated 3 times in each fiber 
monomer for a total of 9 motifs per fiber molecule. In 
particular, the YIGSR [SEQ ID NO: 66) motif provides for 
targeting to the 67 kilodalton laminin/elastin receptor. 
5 This receptor is present in monocytes/neutrophils, 

vascular smooth muscle, fibroblasts, and chondrocytes, 
and is upregulated in multiple tumors. Furthermore, the 
receptor appears to be involved in tumor metastasis and 
angiogenesis . Typical ligands for the laminin/elastin 

10 receptor include laminin, elastin, and galactose. The 
pl93 (F5) pYIGSR piasmid derived herein further was 
employed to create the adenovirus vector AdZ. pYIGSR. 

This example thus confirms that other 
oligonucleotides can be employed for inserting a 

15 nonnative amino acid sequence into a fiber protein. Such 
insertions can either be made in an exposed loop of the 
adenovirus fiber knob, or, as described as follows, at 
the C-terminus of the fiber protein. Moreover, the 
nonnative amino acid sequence can be incorporated into 

20 the chimeric fiber protein not merely as an insertion 
into the sequence, but also, as a replacement of 
adenoviral sequences. This can be done through simple 
modification of the cloning procedures described herein, 
such as are known to those skilled in the art. 

25 

Example 9 

This example describes the characterization of 
adenoviral vectors encoding fiber sequences having an 
insertion of a constrained RGD peptide motif at the C- 

30 terminus of the adenovirus fiber protein. In particular, 
the ability of these vectors to produce active virus 
particles in different ceils was investigated. 

For the Western analysis of virus particles, 
purified virus particles (2 x 10^°) in a volume of 10 nl 

35 were diluted 1:1 in Laemmli running buffer and loaded 
onto a 9% acrylamide, 0.1% SOS gel. The gel was run at 
150 mV and was then transferred to nitrocellulose. The 
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nitrocellulose was- blocked with 5% dry milk and probed 
with a combination of rabbit polyclonal antibodies 
directed against denatured Ad5 virions (1:1000) and 
against fiber protein (1:5000). The proteins were 
5 detected using antirabbit-peroxidase (1:5000) and a 
commercially available chemiluminescent detection kit. 

The fiber proteins of the recombinant adenoviruses 
AdZ.pGS and ADZ.RGD were shifted upward on the Western 
relative to the fiber protein contained by the AdZ 
10 vector. A gel run in parallel that was transferred to 
nitrocellulose and probed using only the polyclonal 
antibody directed against the fiber protein demonstrated 
that the shifted bands in the Western analysis were, in 
fact, fiber protein. These results confirm that the 
15 AdZ.pGS and Adz. ROD fiber proteins contain the 
appropriate amino acid inserts. 

The viral production kinetics were determined to 
confirm that viable adenovirus was being produced in 293 
cells infected with various adenoviral vectors according 
20 to the invention. To carry out these studies, 

radiolabeled adenovirus was made by adding 50 (iCi/ml 
[ H] thymidine- (Amersham, Arlington Heights, IL) to the 
medium of infected cells at 20 hours following their 
infection at an MOI of 5 . The infected cells were then 
25 harvested at 60 hours post-infection, and the virus was 
purified as previously described- The activity of the 
labeled viruses was approximately lo" virus particles/cpm. 
Infectious particles were titered in fluorescence focus 
units (ffu) using a fluorescent focus assay on 293 cells. 
30 Active virus particle production kinetics from 

infected 293 cells were determined by infecting 10^ 293 
cells with 0.2 ml of either AdZ or AdZ.RGD for 1 hour in 
6 cm plates at an MOI of 10 on day 0. The cells were 
harvested on 1, 2, and 3 days post-infection. The cells 
35 were spun down and resuspended in 1 ml of PBS for AdZ and 
AdZ.RGD. The cells were frozen and thawed 3 times to 
release the. virus particles. The lysates were then 
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assayed for the number of active particles produced per 
cell using standard techniques. The results of these 
experiments (depicted in Figure 11) confirm that the 
modifications to the fiber protein in AdZ.RGD do not 
5 significantly affect the production of active virus 
particles compared to the unmodified vector, AdZ. 

The particle dose-response of the vectors AdZ and 
AdZ.RGD on A549 epithelial, CPAE endothelial, and human 
intestinal smooth muscle (HISM) cells similarly was 

10 investigated. HISMC, CPAE, or A549 cells (5 x 10^ 

cells/well) were seeded onto 6 cm plates 1-2 days prior 
to experiments. In assays evaluating the vector dose- 
response in fiber receptor-expressing cells, increasing 
concentrations of AdZ or AdZ.RGD particles were incubated 

15 with the cells for 60 minutes at 37''C in 0.2 ml DMEM + 20 
mM HEPES. The plates were shaken every 10 minutes during 
this incubation. The cells were then washed 2 times with 
DMEM and cultured in DMEM + 5% calf serum for 2-3 days at 
37°C. The medium was than aspirated, and the cells were 

20 lysed in 1 ml IX reporter lysis buffer -t- 10 mM EDTA 

(Promega, Madison, WI). The p-galactosidase activity in 
the cell lysates was then assayed as previously 
described. Results are the average of duplicate 
measurements. 

25 The results of these experiments are presented in 

Figures 12-14. These experiments confirm that the AdZ 
and AdZ.RGD vectors are equivalent in terms of their 
ability to enter and produce viable virus particles in 
cells {A549) known to express high levels of adenovirus 

30 fiber receptor (i.e., A549 cells as presented in Figure 
12). However, for the CPAE and HISM cells (i.e., 
presented in Figure 13 and Figure 14, respectively) which 
lack significant levels of adenovirus fiber receptor, but 
do express integrins, the AdZ.RGD vector is much more 

35 efficient in transduction than is the unmodified, AdZ 
vector. Transduction of the CPAE and HISM cells by 
AdZ.RGD is roughly 100-fold and 30-fold higher. 


respectively, than AdZ over a wide range of vector 
concentrations . 

These results validate that amino acid inserts 
present in adenoviral vectors according to the invention 
I are appropriately translated within the context of the 
chimeric adenovirus fiber protein, and that the resultant 
chimeric fiber protein is functional, as assessed by the 
generation of viable adenoviruses containing this 
protein. Moreover, the results confirm that the peptide 
motif present in the chimeric fiber protein is able to 
redirect adenovirus binding, and to selectively effect 
adenoviral cell binding/entry with a high efficiency. 

Example 10 

This example describes the binding behavior of 
adenoviral vectors encoding fiber sequences having an 
insertion of a various constrained peptide motif at the 
C-terminus of the adenovirus fiber protein. 

The specificity of the AdZ and the AdZ.RGD vectors 
in binding to kidney (835), smooth muscle (AlO) , and 
endothelial (CPAE) cells was studied. For these 
experiments, monolayers of 835, AlO, or CPAE cells in 24 
well tissue culture plates were preincubated for 45 
minutes with 0.3 ml medium containing soluble recombinant 
fiber (F5; 3 ug/ml), penton base (PB; 50 ^g/ml), fiber 
plus penton base, or neither coat protein. Radiolabeled 
AdZ or AdZ.RGD was then added to the wells and incubated 
for 90 minutes while rocking at room temperature. The 
wells were washed 3 times with PBS, and the remaining 
cell-associated radioactivity was determined in a 
scintillation counter. The results of these experiments 
are presented graphically in Figurea 15-17, and 
quantitatively in Tabla 3. 
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Con^arison of AdZ and AdZ.RGD binding to three 
cell linos* 



835 

HEK** 

CPAE 


AlO* 



AdZ 

AdZ. 
RGD 

AdZ 

AdZ . 

RGD 

AdZ 

AdZ. 

. RGD 

Control 

7.6 

12.7 

0.19 

0.84 

0.72 

1.68 

Fiber 

1.7 

12.3 

0.22 

1.06 

0.23 

1.40 

PB 

9.0 

9.7 

0.20 

0.37 

0.80 

0.62 

Fiber/PB 

1.0 

3.7 

0.21 

0.4 6 

0.20 

0.41 


Values represent percentage of input vector in binding 


These results confirm that fiber protein significantly 
blocks AdZ transduction, but not AdZ.RGD transduction of 
both the 835 (Figure 15) and AlO (Figure 16) cells. Only 
fiber plus penton base, which, in combination, blocks 
both fiber receptor and ttv integrins, is able to 
significantly block binding of AdZ.RGD to these cells. 
For the CPAE cells which lack detectable levels of fiber 
receptor (Figure 17) penton base alone is able to 
significantly block binding of AdZ.RGD. 

These results demonstrate that AdZ.RGD interacts 
with ttv integrins on cells. Moreover, the results 
validate that the peptide motif as present in the fiber 
protein of AdZ.RGD can effectively be employed to target 
adenovirus to particular cells. 


Example 11 

This example describes gene delivery mediated by 
adenoviral vectors encoding insertions of various 
.30 sequences at the C-terrainus of the adenovirus fiber 
protein. 

For testing the ability of the YIGSR [SEQ ID NO: 66] 
peptide motif to effect cell targeting, A54 9 cells were 
preincubated for 30 minutes in the presence and absence 
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of competing wild-type fiber protein. Purified AdZ or 
AdZ.pYIGSR vectors were then incubated with the cells for 
an additional 60 minutes at 37''C. The cells were then 
washed 3 times with PBS and incubated in culture medium 
5 overnight, p-galactosidase activity from the lysed cells 
was determined. 

As presented in Figure 18, recombinant wild- type 
fiber protein completely blocked gene delivery by both 
vectors. Increased gene delivery by the AdZ.pYIGSR 
10 vector is not observed in the presence of fiber protein. 
This indicates that the pYIGSR targeting motif is not of 
sufficiently high affinity to overcome the block to 
adenovirus binding that is achieved with the addition of 
soluble fiber protein. 
15 For testing the ability of the pLDV motif to effect 

cell targeting, Ramos cells (which express high levels of 
the a* integrin target receptor) were preincubated for 30 
minutes in the presence and absence of competing wild- 
type fiber protein. The purified AdZ or AdZ.pLDV vectors 
20 were then incubated with the cells for an additional 60 
minutes at 31'*C. The cells were washed 3 times with PBS, 
and incubated in culture medium overnight. |3- 
galactosidase activity from the lysed cells was then 
determined. 

25 Figxiro 19 illustrates gene delivery to Ramos cells 

effected by the AdZ.pLDV vector. As can be seen from 
this figure, recombinant wild-type fiber protein blocked 
gene delivery by both AdZ and AdZ.pLDV. As with 
AdZ.pYIGSR, there is no evidence of increased gene 

30 delivery effected by the AdZ.pLDV vector in the presence 
of fiber protein. This indicates that the pLDV targeting 
motif, like the YIGSR [SEQ ID NO:66] targeting motif, is 
not of sufficiently high affinity to overcome the fiber- 
mediated block to protein binding. The remaining gene 

35 delivery capacity of AdZ.pLDV that is not blocked by the 
addition of soluble fiber protein also is not blocked by 
further incubation with EDTA. In comparison, the 
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interaction of the a4 integrins with the LDV motif 
normally present in fibronectin is blocked by EDTA. This 
result further confirms that the pLDV targeting motif is 
not interacting with high enough affinity with a4 
5 integrins to increase vector binding and gene delivery to 
Che Ramos cells. However, with both the YIGSR motif 
(i.e., comprising the sequence of [SEQ ID NO: 66] and the 
LDV motif, it is possible that high affinity peptide 
motifs could be derived by the conformational restraint 
10 of these peptides in an exposed loop of the fiber 
proteins . 

The ability of the RGD motif to effect cell 
targeting similarly was studied in av-integrins 
expressing 293 cells. These studies were carried out as 

15 for the other peptide motifs/cell lines. However, for 

comparative purposes, the vectors AdZ and AdZ.pRGD (i.e., 
the vector containing multiple copies of the RGD motif 
not having cysteine residues) were also included. The 
results of these studies are presented in Figure 20. As 

20 can be' seen from this figure, AdZ, RGD, but not AdZ.pRGD, 
clearly was able to overcome the fiber-mediated block to 
adenoviral-mediated gene delivery. 

These results thus confirm that the RGD peptide 
motif (i.e., present as a loop at the C-terminus of the 

25 fiber protein), like the RKKK2 motif present in a loop of 
the adenovirus fiber protein (described in Example 4), is 
of sufficiently high affinity that it was able to 
overcome the fiber-mediated block to adenoviral-mediated 
gene delivery, and effectively "swamp out" the typical 

30 interaction of wild-type fiber protein with its cellular 
receptor to target the adenovirus to a new receptor. 

The results further confirm that the constraint of a 
nonnative amino acid sequence (i.e., either through 
insertion in a fiber loop or creation of a loop like 

35 structure at the fiber terminus) can result in the 

creation of a high affinity peptide motif. Such a high 
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affinity peptide motif is of use in adenoviral cell 
targeting. 

All of the references cited herein, including 
5 patents, patent applications, and publications, are 

hereby incorporated in their entireties by reference to 
the same extent as if each reference were set forth in 
Its entirety herein. 

While this invention has been described with an 
emphasis upon preferred embodiments, it will be apparent 
to those of ordinary skill in the art that variations in 
the preferred embodiments can be prepared and used and 
that the invention can be practiced otherwise than as 

15 ;^;^'f"^^^^-«^^«^^-ein- The present invention is 
xntended to include such variations and alternative 
practices. Accordingly, this invention includes all 
modifications encompassed within the spirit and scope of 
the invention as defined by the following claims 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: GenVec, Inc. 

(ii) TITLE OF INVENTION: TARGETING ADENOVIRUS WITH USE OF 
CONSTRAINED PEPTIDE MOTIFS 

(iii) NUMBER OF SEQUENCES: 89 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Leydig, Voit & Mayer, Ltd. 

(B) STREET: Two Prudential Plaza, Suite 4900 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: US 

(F) ?,IP: 60601 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
(U) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln. Release #1.0, Version " ff 1 . 30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : WO 

(B) FILING DATE: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/701,124 

(B) FILING DATE: 21-AUG-1996 


(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino' acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Arg Lys Lys Lys Arg Lys Lys Lys 
1 5 


(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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- (D) TOPOLOGY: linear 
(ii) lMOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
Asp Tyr Lys Asp Asp Asp Asp Lys 


(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
Cys Asp Cys Arg Gly Asp Cys Phe Cys 


(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Cys Xaa Cys Arg Gly Asp Cys Xaa Cys 
1 5 


(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Xaa 
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Xaa Xaa Xaa Xaa Xaa Xaa Cys 
20 


(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TCA TAC ATT GCC CAA GAA TAAA 
Ser Tyr He Ala Gin Glu 
1 5 


(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ser Tyr He Ala Gin Glu 

1 . 5 


(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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TCA TAG ATT GCC CAA GAA GGA TCC AAT AAA 
Ser Tyr He Ala Gin Glu Gly Ser Asn Lys 
^ 5 10 


(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID 

Ser Tyr He Ala Gin Glu Gly Ser Asn Lys 

^ 5 10 


(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
(D) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TCCCCCCGGG TCTAGATTAG GATCCTTCTT GGGCAATGTA : 


(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CGTGTATCCA TATGACACAG A 

21 

(2) INFORMATION FOR SEQ ID NO: 12: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TATGGAGGAT CCAATAAAGA ATCGTTTGTG TTATGTTTCA ACGTGTTTAT TTTTC 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
AATTGAAAAA TAAACACGTT GAAACATAAC ACAAACGATT CTTTATTGGA TCCTCCA 

(21 INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGCCATGGCC TAGAATTTGA TTCAAACGGT GCCATGATTA CTAAACTTGG AGCG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 
GCGGATCCTT ATTCCTGGGC AATGTAGGA 

(2) INFORMATION FOR SEQ ID NO: 16: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATT ACA CTT AAT GGC ACT AGT GAA TCC ACA GAA ACT 
lie Thr Leu Asn Gly Thr Ser Glu Ser Thr Glu Thr 
15 20 


(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

lie Thr Leu Asn Gly Thr Ser Glu Ser Thr Glu Thr 
^ 5 10 


(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ACT AGA AAA AAA AAA CGC AAG AAG AAG ACT AGT 
Thr Arg Lys Lys Lys Arg Lys Lys Lys Thr Ser 
15 20 


(2) INFORMATION FOR SEQ ID N0:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Thr Arg Lys Lys Lys Arg Lys Lys Lys Thr Ser 
1 5 10 


(2) INFORMATION FOR SEQ ID N0:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2'? base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CTAGAAAGAA GAAACGCAAA AAGAAGA 


(2) INFORMATION FOR SEQ ID NO: 21: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CTAGTCTTCT TTTTGCGTTT CTTCTTT 


(2) INFORMATION FOR SEQ ID NO:22: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

ACT AGA GAC TAC AAG GAC GAG GAT GAT AAG ACT AGT 
Thr Arg Asp Tyr Lys Asp Asp Asp Asp Lys Thr Ser 
15 20 


(2) INFORMATION FOR SEQ ID NO:23: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Thr Arg Asp Tyr Lys Asp Asp Asp Asp Lys Thr Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CTAGAGACTA CAAGGACGAC GATGATAAGA 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: ' 
CTAGTCTTAT CATCGTCGTC CTTGTAGTCT 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 63 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 


ATT ACA CTT AAT GGC 
lie Thr Leu Asn Gly 
1 5 


ACT AGA AAG AAG AAA CGC AAA AAG AAG ACT ACT 
Thr Arg Lys Lys Lys Arg Lys Lys Lys Thr Ser 
10 15 


48 


GAA TCC ACA GAA ACT 
Glu Ser Thr Glu Thr 
20 


(2) INFORMATION FOR, SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

lie Thr Leu Asn Gly Thr Arg Lys Lys Lys Arg Lys Lys Lys Thr Ser 
1 5 10 15 

Glu Ser Thr Glu Thr 
20 


(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 66 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ATT ACA CTT AAT GGC ACT AGA GAG TAG AAG GAC GAC GAT GAT AAG ACT 48 
lie Thr Leu Asn Gly Thr Arg Asp Tyr Lys Asp Asp Asp Asp Lys Thr 
15 10 15 

AGT GAA TCC ACA GAA ACT 66 
Ser Glu Ser Thr Glu Thr 
20 


(2) INFORMATION FOR SEQ ID NO: 29: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

He Thr Leu Asn Gly Thr Arg Asp Tyr Lys Asp Asp Asp Asp Lys Thr 

15 10 15 

Ser Glu Ser Thr Glu Thr 
20 


(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: -15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

ACT AGA GCC TGC GAC TGT CGC GGC GAT TGT TTT TGC GGT ACT AGT 
Thr Arg. Ala Cys Asp Cys Arg Gly Asp Cys Phe Cys Gly Thr Ser 
^5 10 15 


(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D)' TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Thr Arg Ala Cys Asp Cys Arg Gly Asp Cys Phe Cys Gly Thr Ser 
1 5 10 15 


(2) INFORMATION FOR SEQ ID NO: 32: 


(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
CTAGAGCCTG CGACTGTCGC GGCGATTGTT TTTGCGGTA 


(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

,(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
CTAGTACCGC AAAAACAATC GCCGCGACAG TCGCAGGCT 


(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS :. double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..39 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ACT AGA TGC CGC CGC GAA AGO GCT TGG GCC TGT ACT AGT 
Thr Arg Cys Arg Arg Glu Thr Ala Trp Ala Cys Thr Ser 
1 5 10 


(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
Thr Arg Cys Arg Arg Glu Thr Ala Trp Ala Cys Thr Ser 

- 5 10 - . 

t2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
CTAGATGCCG CCGCGAAACC GCTTGGGCCT GTA 33 

(2) INFORMATION FOR SEQ ID NO: 37 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CTAGTACAGG CCCAAGCGGT TTCGCGGCGG CAT 33 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE:. 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .48 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

ACT AGA GAC ATT ACC TGG GAC CAG CTT TGG GAC CTT ATG AAG ACT AGT 4 a 

Thr Arg Asp He Thr Trp Asp Gin Leu Trp Asp Leu Met Lys Thr Ser 
^ 5 10 15 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Thr Arg Asp He Thr Trp Asp Gin Leu Trp Asp Leu Met Lys Thr Ser 

5 ■ 10 15 


(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic. acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CTAGAGACAT TACCTGGGAC CAGCTTTGGG ACCTTATGAA GA 

(2) INFORMATION FOR SEQ ID NO: 41: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CTAGTCTTCA TAAGGTCCCA AAGCTGGTCC CAGGTAATGT CT 


(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4"2: 

ThI Arc f^' AAA AAA AAA AGA AAG AAG AAG ACT ACT 

Thr Arg Lys Lys Lys Arg Lys Lys Lys Arg Lys Lys Lys Thr Ser 
^ 10 15 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Thr Arg Lys Lys Lys Arg Lys Lys Lys Arg Lys Lys Lys Thr Ser 
^ 10 15 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
CTAGAAAGAA GAAGCGCAAA AAAAAAAGAA AGAAGAAGA 

(2) INFORMATION FOR SEQ ID NO:45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

CTAGTCTTCT TCTTTCTTTT TTTTTTGCGC TTCTTCTTT 
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(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 6: 
GATCAGGATC AGGTTCAGGG AGTGGCTCTG CCTGCGACTG TCGCGGCGAT TGTTTTTGCG 


(2) INFORMATION FOR SEQ ID NO: 47 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GATCCTTAAC CGCAAAAACA ATCGCCGCGA CAGTCGCAGG CAGAGCCACT CCCTGAACCT 
GATCCT 


(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..72 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GCC CAA GAA GGA TCA GGA TCA GGT TCA GGG AGT GGC TCT GCC TGC GAC 48 
Ala Gin Glu Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Ala Cys Asp 

15 10 15 


TGT CGC GGC GAT TGT TTT TGC GGT TAAGGATCCA ATAA 


86 
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Cys Arg Gly Asp Cys Phe Cys Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) . TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Ala Gin Gla Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Ala Cys Asp 
10 15 

Cys Arg Gly Asp Cys Phe Cys Gly 
20 


(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GATCCGGTTC AGGATCTGGC AGTGGCTCGA CTAGTTAAA 


(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:51: 
GATCTTTAAC TAGTCGAGCC ACTGCCAGAT CCTGAACCG 


(2) INFORMATION FOR SEQ ID NO: 52: 


(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 66 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 66 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52:^ 

ACT ACT GGA AGA GGA GAT ACT TTT GGC CGC GGC GAC ACG TTC GGA AGG 4 8 

Thr Ser Gly Arg Gly Asp Thr Phe Giy Arg Gly Asp Thr Phe Gly Arg 
15 10 15 

GGG GAT ACA TTT TCT AGT 66 
Gly Asp Thr Phe Ser Ser 
20 


(2) INFORMATION FOR SEQ IDNO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Thr Ser Gly Arg Gly Asp Thr Phe Gly Arg Gly Asp Thr Phe Gly Arg 

1 ' 5 .10 15 

Gly Asp Thr Phe Ser Ser 
20 


(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ -ID NO: 54: 
CTAGTGGAAG AGGAGATACT TTTGGCCGCG GCGACACGTT CGGAAGGGGG GATACATTTT 60 


(2) INFORMATION FOR SEQ ID NO:55: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CTAGAAAATG TATCCCCCCT TCCGAACGTG TCGCCGCGGC CAAAAGTATC TCCTCTTCC 


(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Glu He Leu Asp Val Pro Ser Thr 
1 5 


(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Glu He Leu Asp Val Pro Ser Glu He Leu. Asp Val Pro Ser Glu He 

1. 5 10 . 15 

Leu Asp Val Pro Ser 
20 


(2; INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1 . . 63 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

ACT ACT GAA ATT CTT GAC GTC GGA GAG ATC CTC GAC GTC GGG GAA ATA 
Thr Ser Glu lie Leu Asp Vai Gly Glu He Leu Asp Vai Gly Glu He 


CTG GAC GTC TCT AGT 
Leu Asp Vai Ser Ser 


(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Thr Ser Glu lie Leu Asp Vai Gly Glu He Leu Asp Vai Gly Glu He 
1 5 10 15 

Leu Asp Vai Ser Ser 


(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CTAGTGAAAT TCTTGACGTC GGAGAGATCC TCGACGTCGG GGAAATACTG GACGTCT 


(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CTAGAGACGT CCAGTATTTC CCCGACGTCG AGGATCTCTC CGACGTCAAG AATTTCA 57 


(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 66 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

ACT AGT GGA TAC ATC GGC AGT CGC GGT TAG ATT GGG TCC CGA GGA TAT 
Thr Ser Gly Tyr He Gly Ser Arg Gly Tyr He Gly Ser Arg Gly Tyr 

1 . 5 . 10 .15 

ATA GGC TCA AGA TCT AGT 
He Gly Ser Arg Ser Ser 
20 


(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Thr Ser Gly Tyr He Gly Ser Arg Gly Tyr He Gly Ser Arg Gly Tyr 
15 10 15 

He Gly Ser Arg Ser Ser 
20 
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(2) INFORMATION FOR SEQ ID NO: 64 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CTAGTGGATA CATCGGCAGT CGCGGTTACA TTGGGTCCCG AGGATATATA GGCTCAAGAT 


(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 
CTAGATCTTG AGCCTATATA TCCTCGGGAC CCAATGTAAC CGCGACTGCC GATGTATCCA 


(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

Tyr lie Gly Ser Arg 
1 5 


(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Tyr lie Giy Ser Arg Gly 
1 5 


(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 

Ser Ala Cys Asp Cys Arg Gly Asp Cys Phe Cys Cys Thr Ser 
1 5-10 


(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
He Thr Leu Asn Gly 


(2) INFORMATION FOR SEQ ID NO:70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Glu Ser Thr Glu Thr 

1 5 . 


(2) INFORMATION FOR SEQ ID NO: 71: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Phe Ser Tyr lie Ala Gin Glu 
1 5 


(2) INFORMATION FOR SEQ ID NO:72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser 
1 5 10 


(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..36 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

ATT ACA CTT AAT GGC ACT AGT GAA TCC ACA GAA ACT 
He Thr Leu Asn Gly Thr Ser Glu Ser Thr Glu Thr 
1 5 10 


(2) INFORMATION FOR SEQ ID NO:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
(8) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
lie Thr Leu Asn Gly Thr Ser Glu Ser Thr Glu Thr 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 102 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

GCC CAA GAA GGA TCC GGT TCA GGA TCT GGC AGT GGC TCG ACT AGT GAA 
Ala Gin Glu Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Thr Ser Glu 
1 5 10-15 

ATT CTT GAC GTC GGA GAG ATC CTC GAC GTC GGG GAA ATA CTG GAC G-^C 
He Leu Asp Val Gly Glu He Leu Asp Val Gly Glu He Leu Asp Val 
20 25 30 

TCT AGT TAA 
Ser Ser 

(2) INFORMATION FOR SEQ ID NO:76: 

(i) SEQUENCE CHARACTERISTICS: 

!A) LENGTH: 34 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

Ala Gin Glu Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Thr Ser Glu 
^ 5 10 15 

He Leu Asp Val Gly Glu He Leu Asp Val Gly Glu He Leu Asp Val 
20 25 30 

Ser Ser 
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(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 base pairs 
(9) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..105 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

GCC CAA GAA GGA TCC GGT TCA GGA TCT GGC AGT GGC TCG ACT AGT GGA 
Ala Gin Glu Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Thr Ser Gly 
1 5 10 15 

TAC ATC GGC AGT CGC GGT TAC ATT GGG TCC CGA GGA TAT ATA GGC TCA 
Tyr lie Gly Ser Arg Gly Tyr lie Gly Ser Arg Gly Tyr lie Gly Ser 


AGA TCT AGT TAA 
Arg Ser Ser 


(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids . 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

Ala Gin Glu Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Thr Ser Gly 
1 5 . 10 15 

Tyr lie Gly Ser Arg Gly Tyr He Gly Ser Arg Gly Tyr He Gly Ser 
20 25 30 


Arg Ser Ser 
35 


(2) INFORMATION FOR SEQ ID NO:79: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:79: 

Ser Ala Cys Asp Cys Arg Gly Asp Cys Phe Cys Gly 
^5 10 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Glu lie Leu Asp Vai Pro Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Ser Gly Thr Val Gin 

1 ■ 5 . 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 


wo 98/07865 


93 


PCT/US97/14719 


Gly Ser Gly Ser Gly 
1 5 


(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

GGC AGT TTG GCT CCA ATA GGA TCC GGG TCT GGA AGT GCT CAT CTT ATT 
Gly Ser Leu Ala Pro He Gly Ser Gly Ser Gly Ser Ala His Leu He 
15 10 15 


(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Gly Ser Leu Ala Pro He Gly Ser Gly Ser Gly Ser Ala His Leu 
1 5 10 15 


(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
AATAAGATGA GCACTTCCAG ACCCGGATCC TATTGGAGCC AAACTGCC 


(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 

Ser His Gly Lys Thr Ala 
1 5 

C2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Ser Gly His Asn 
1 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 

Gly Ser Gly Ser Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Gly Ser Gly Ser 

1 
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WHAT IS CLAIMED IS: 

1. A chimeric adenovirus fiber protein comprising 
a constrained nonnative amino acid sequence. 

2. The chimeric adenovirus fiber protein of claim 
1, wherein said chimeric adenovirus fiber protein more 
efficiently facilitates the entry into cells of a vector 
comprising said adenoviral fiber protein than a vector 
that is identical except for comprising a wild-type 
adenovirus fiber protein rather than said chimeric 
adenovirus protein. 

3. The chimeric adenovirus fiber protein of claim 1 
or 2, wherein said chimeric adenovirus fiber protein 
binds a binding site present on a cell surface which 
wild-type fiber protein does not bind. 

4. The chimeric adenovirus fiber protein of . any of 
claims 1-3, wherein said nonnative amino acid sequence 
comprises an epitope for an antibody or a ligand for a 
cell surface binding site. 

5. The chimeric adenovirus fiber protein of any of 
claims 1-4, wherein said nonnative amino acid sequence 
comprises from about 3 to about 200 amino acids. 

6. The chimeric adenovirus fiber protein of any of 
claims 1-4, wherein said nonnative amino acid sequence 
comprises from about 3 to about 30 amino acids. 

7. The chimeric adenovirus fiber protein of any of 
claims 1-4, wherein said nonnative amino acid sequence 
comprises a sequence selected from the group consisting 
of SEQ ID N0:1, SEQ ID N0:2, SEQ ID N0:3, SEQ ID N0:4, 
SEQ ID N0:5, SEQ ID N0:17, SEQ ID N0:19, SEQ ID NO:23, 
SEQ ID N0:31, SEQ ID NO:35, SEQ ID NO:39, SEQ ID NO:43, 


SEQ ID NO:49, SEQ ID NO:53, SEQ ID NO:56, SEQ ID NO:59, 
and SEQ ID NO:63, SEQ ID NO:66, SEQ ID NO:67, SEQ ID 
NO: 68, SEQ ID NO: 79, conservative amino acid 
substitutions thereof, and C- or N-terminal deletions 
thereof, wherein said deletions remove 1, 2, or 3 
residues. 

8. The chimeric adenovirus fiber protein of any of 
claims 1-7, wherein said fiber protein has a knob, said 
knob has one or more loops, and said nonnative amino acid 
sequence is inserted into, or in place of, the native 
amino acid sequence of a loop selected from the group 
consisting of the AB, CD, DG, GH, IJ, and HI loops of 
said knob. 

9. The chimeric adenovirus fiber protein of any of 
claims 1-8, wherein said nonnative amino acid sequence 
comprises a sequence selected from the group consisting 
of SEQ ID N0:3, SEQ ID N0:4, and SEQ ID N0:5, 
conservative amino acid substitutions thereof, and C- or 
N-terminal deletions thereof, wherein said deletions 
remove 1, 2, or 3 residues. 

10. The chimeric adenovirus fiber protein of any of 
claims 1-9, wherein said nonnative amino acid sequence is 
constrained by its possession of an RGD sequence and one 
or more cysteine pairs. 

11. The chimeric adenovirus fiber protein of any of 
claims 1-10, wherein said nonnative amino acid sequence 
is inserted into or in place of a protein sequence at the 
C-terminus of said chimeric adenovirus fiber protein. 


12. An isolated or purified nucleic acid encoding a 
chimeric adenovirus fiber protein of any of claims 1-11. 
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13. A transfer vector comprising the isolated or 
purified nucleic acid of claim 12. 

14. The transfer vector of claim 13, wherein said 
transfer vector is selected from the group consisting of 
pl93(F5*), pl93 F52K(RKKK2), pl93 F5F2K, 
pl93(F5)pGS(RGD) , pl93 (F5) pLDV, pi 93 ( F5 ) pYIGSR, 

pl93 (F5*) RGD, andpl93 F552K{FLAG). 

15. A vector comprising the chimeric adenovirus 
fiber protein of any of claims 1-11. 

16. The vector of claim 15, wherein said vector is 
a viral vector selected from the group consisting of 
nonenveloped viruses. 

17. The vector of claim 15 or 16, wherein said 
vector is an adenoviral vector. 

18. The vector of claim 17, wherein said vector is 
selected from the group consisting of AdZ.FLAG, 
AdZ.RKKK2, AdZ.pRGD, AdZ.RGD, AdZ.pLDV, and AdZ.YIGSR. 

19. The vector of any of claims 15-18, wherein said 
vector further comprises a passenger gene that is either 
inserted into the viral genome or is attached to a coat 
protein of said adenovirus by means of a protein/DNA 
interaction. 

20. A method of increasing the efficiency of entry 
into cells of a vector comprising a fiber protein, which 
method comprises replacing said fiber protein of said 
vector with the chimeric adenovirus fiber protein of any 
of claims 1-11. 
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21. A method of genetically modifying a cell which 
comprises contac-ing said cell with a vector of any of 
claims 13-19. 

22. A host cell comprising a vector of any of 
claims 13-19. 

23. A method of increasing the affinity of a 
peptide for a cell surface binding site which comprises- 

(a) obtaining a wild-type adenovirus fiber protein, and 

(b) xnserting into or in place of a protein sequence in 
a loop of said knob of said wild-type adenovirus fiber 
protexn a nonnative amino acid sequence so as to result 
m a chimeric adenovirus fiber protein. 

24. A method of increasing the affinity of an RGD 
sequence for a cell surface binding site which comprises - 

(a) obtaining a wild-type adenovirus fiber protein, and 

(b) inserting an RGD sequence into or in place of a 
protein sequence of said wild-type adenovirus fiber 
protein such that it is flanked by one or more cysteine 
pairs and is capable of forming a loop due to interaction 
between said cysteines. 
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